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(57) Abstract 

The present invention relates to a method for the rapid and reliable detection of drug-selected mutations in the HIV protease gene 
allowing the simultaneous characterization of a range of codons involved in drug resistance using specific sets of probes optimized to function 
together in a reverse-hybridization assay. More particularly, the present invention relates to a method for determining the susceptibility to 
antiviral drugs of HIV viruses in a biological sample, with said method comprising: a) if need be, releasing, isolating or concentrating the 
polynucleic acids present in the sample; b) if need be amplifying the relevant part of the protease gene of HIV with at least one suitable 
primer pair; c) hybrydizing the polynucleic acids of step a) or b) with at least one of the following probes: probes specifically hybridizing 
to a target sequence comprising codon 30; probes specifically hybridizing to a target sequence comprising codon 46 and/or 48; probes 
specifically hybridizing to a target sequence comprising codon 50; probes specifically hybridizing to a target sequence comprising codon 
54; probes specifically hybridizing to a target sequence comprising codon 82 and/or 84; probes specifically hybridizing to a target sequence 
comprising codon 90; or the complement of said probes ; further characterized in that said probes specifically hybridize to any of the target 
sequences presented in figure (1), or the complement of said target sequences; d) inferring from the result of step c) whether or not a 
mutation giving rise to drug resistance is present in any of said target sequences. 
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METHOD FOR DETECTION OF DRUG-SELECTED MUTATIONS 
IN THE HIV PROTEASE GENE 

5 1 . FIELD OF THE INVENTION 

The present invention relates to the field of HIV diagnosis. More particularly, the present 
invention relates to the field of diagnosing the susceptibility of an HIV sample to antiviral drugs used 
to treat HIV infection. 

10 The present invention relates to a method for the rapid and reliable detection of drug-selected 

mutations in the HIV protease gene allowing the simultaneous characterization of a range of codons 
involved in drug resistance using specific sets of probes optimized to function together in a reverse- 
hybridization assay. 

15 

2. BACKGROUND OF THE INVENTION 

The human immunodeficiency virus (HIV) is the etiological agent for the acquired 
immunodeficiency syndrome (AIDS). HIV, like other retroviruses, encodes an aspartic protease that 

20 mediates the maturation of the newly produced viral particle by cleaving viral polypeptides into then- 
functional forms (Hunter et al). The HIV protease is a dimeric molecule consisting of two identical 
subunits each contributing a catalytic aspartic residue (Navia et al, Whodawer et al, Meek et al). 
Inhibition of this enzyme gives rise to noninfectious viral particles that cannot establish new cycles of 
viral replication (Kohl et al, Peng et al). 

25 Attempts to develop inhibitors of HIV- 1 protease were initially based on designing peptide 

compounds that mimicked the natural substrate. The availability of the 3-dimensional structure of the 
enzyme have more recently allowed the rational design of protease inhibitors (PI) using computer 
modeling (Huff et al, Whodawer et al). A number of second generation PI that are partially peptidic or 
entirely nonpeptidic have proven to exhibit particularly potent antiviral effects in cell culture. 

30 Combinations of various protease inhibitors with nucleoside and non-nucleoside RT inhibitors have also 
been studied extensively in vitro. In every instance, the combinations have been at least additive and 
usually synergistic. 

In spite of the antiviral potency of many recently developed HIV-1 PI, the emergence of virus 
variants with decreased sensitivity to these compounds has been described both in cell culture and in 
35 treated patients thereby escaping the inhibitory effect of the antiviral (Condra et al). Emergence of 
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resistant variants depends on the selective pressure applied to the viral population. In the case of a 
relatively ineffective drug, selective pressure is low because replication of both wild-type virus and any 
variants can continue. If a more effective drug suppresses replication of virus except for a resistant 
variant, then that variant will be selected. Virus variants that arise from selection by PI carry several 

5 distinct mutations in the protease coding sequence that appear to emerge sequentially. A number of these 
cluster near the active site of the enzyme while others are found at distant sites. This suggests 
conformational adaptation to primary changes in the active site and in this respect certain mutations that 
increase resistance to PI also decrease protease activity and virus replication. 

Amongst the PI, the antiviral activity of the PI ritonavir (A-75925; ABT-538), nelfmavir (AG- 

1 0 1 343), indinavir (MK-639; L735 ; L524) and saquinavir (Ro 3 1-8959) have been approved by the Food 
and Drug Administration and are currently under evaluation in clinical trials involving HIV-infected 
patients. The VX-487 (141 W94) antiviral compound is not yet approved. The most important mutations 
selected for the above compounds and leading to gradually increasing resistance are found at amino acid 
(aa) positions 30 (D to N), 46 (M to I), 48 (G to V), 50 (I to V), 54 (I to A, I to V), 82 (V to A, or F, or 

15 I, or T), 84 (I to V) and 90 (L to M). Other mutations associated with drug resistance to the mentioned 
compounds have been described (Schinazi et af). Saquinavir-resistant variants, which usually carry 
mutations at amino acid positions 90 and/or 48, emerge in approximately 45% of patients after 1 year 
of monotherapy. Resistance appears to develop less frequently with higher doses of saquinavir. 
Resistance to indinavir and ritonavir requires multiple mutations; usually at greater than 3 and up to 1 1 

20 sites, with more amino acid substitutions conferring higher levels of resistance. Resistant isolates usually 
carry mutations at codons 82, 84, or 90. In the case of ritonavir, the mutation at codon 82 appears first 
in most patients. Although mutant virions resistant to saquinavir are not cross-resistant to indinavir or 
ritonavir, isolates resistant to indinavir are generally ritonavir resistant and visa versa. Resistance to 
either indinavir or ritonavir usually results in cross-resistance to saquinavir. Approximately one third 

25 of indinavir resistant isolates are cross-resistant to nelfinavir as well. 

The regime for an efficient antiviral treatment is currently not clear at all. Patterns of reduced 
susceptibility to HIV protease inhibitors have been investigated in vitro by cultivating virus in the 
presence of PL These data, however, do not completely predict the pattern of amino-acid changes 
actually seen in patients receiving PL Knowledge of the resistance and cross-resistance patterns should 

30 facilitate selection of optimal drug combinations and selection of sequences with non-overlapping 
resistance patterns. This would delay the emergence of cross-resistant viral strains and prolong the 
duration of effective antiretroviral activity in patients. Therefore, there is need for methods and systems 
that detect these mutational events in order to give a better insight into the mechanisms of HIV 
resistance. Further, there is need for methods and systems which can provide data important for the 

35 antiviral therapy to follow in a more time-efficient and economical manner compared to the conventional 
cell-culture selection techniques. 
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3 . AIMS OF THE INVENTION 

It is an aim of the present invention to develop a rapid and reliable detection method for 
5 determination of the antiviral drug resistance of viruses, which contain protease genes such as HIV 
retroviruses present in a biological sample. 

More particularly it is an aim of the present invention to provide a genotyping assay allowing 
the detection of the different HIV protease gene wild type and mutation codons involved in the antiviral 
resistance in one single experiment. 
10 It is also an aim of the present invention to provide an HTV protease genotyping assay or method 

which allows to infer the nucleotide sequence at codons of interest and/or the amino acids at the codons 
of interest and/or the antiviral drug selected spectrum, and possibly also infer the HIV type or subtype 
isolate involved. 

Even more particularly it is an aim of the present invention to provide a genotyping assay 
15 allowing the detection of the different HIV protease gene polymorphisms representing wild-type and 
mutation codons in one single experimental setup. 

It is another aim of the present invention to select particular probes able to discriminate wild- 
type HIV protease sequences from mutated or polymorphic HTV protease sequences conferring resistance 
to one or more antiviral drugs, such as ritonavir (A-75925; ABT-538), nelfinavir (AG-1343), indinavir 
20 (MK-639; L735; L524), saquinavir (Ro 31-8959) and VX-478 (141W94) or others (Shinazi et a!). 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease sequences from mutated or polymorphic HIV protease sequences 
conferring resistance to ritonavir (A-75925; ABT-538). 

It is more particularly an aim of the present invention to select particular probes able to 
25 discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to nelfinavir (AG- 1 343). 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease sequences from mutated HTV protease sequences conferring 
resistance to indinavir (MK-639; L735; L524). 
30 It is more particularly an aim of the present invention to select particular probes able to 

discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to saquinavir (Ro 31-8959). 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
35 resistance to VX-478 (141 W94). 

It is also an aim of the present invention to select particular probes able to determine and/or infer 
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cross-resistance to HIV protease inhibitors. 

It is more particularly an aim of the present invention to select particular probes able to 
discriminate wild-type HIV protease from mutated HIV protease sequences involving at least one of 
amino acid positions 30 (D to N), 46 (M to I), 48 (G to V), 50 (I to V), 54 (I to A or V), 82 (V to A or 
5 F or I or T), 84(1 to V) and 90 (L to M) of the viral protease gene. 

It is particularly an aim of the present invention to select a particular set of probes, able to 
discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to any of the antiviral drugs defined above with this particular set of probes being used in a 
reverse hybridization assay. 
10 It is moreover an aim of the present invention to combine a set of selected probes able to 

discriminate wild-type HIV protease sequences from mutated HIV protease sequences conferring 
resistance to antiviral drugs with another set of selected probes able to identify the HIV isolate, type or 
subtype present in the biological sample, whereby all probes can be used under the same hybridization 
and wash-conditions. 

15 It is also an aim of the present invention to select primers enabling the amplification of the gene 

fragment(s) determining the antiviral drug resistance trait of interest. 

It is also an aim of the present invention to select particular probes able to identify mutated HTV 
protease sequences resulting in cross-resistance to antiviral drugs. 

The present invention also aims at diagnostic kits comprising said probes useful for developing 
20 such a genotyping assay. 

The present invention also aims at diagnostic kits comprising said primers useful for developing 
such a genotyping assay. 



25 4. DETAILED DESCRIPTION OF THE INVENTION. 

All the aims of the present invention have been met by the following specific embodiments. 

According to one embodiment, the present invention relates to a method for determining the 
susceptibility to antiviral drugs of HIV viruses in a biological sample, with said method comprising: 
30 a) if need be, releasing, isolating or concentrating the polynucleic acids present in the sample; 

b) if need be amplifying the relevant part of the protease gene of HIV with at least one suitable primer 
pair; 

c) hybridizing the polynucleic acids of step a) or b) with at least one of the following probes: 

probes specifically hybridizing to a target sequence comprising codon 30; 
35 probes specifically hybridizing to a target sequence comprising codon 46 and/or 48; 

probes specifically hybridizing to a target sequence comprising codon 50; 
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probes specifically hybridizing to a target sequence comprising codon 54; 
probes specifically hybridizing to a target sequence comprising codon 82 and/or 84; 
probes specifically hybridizing to a target sequence comprising codon 90; 
or the complement of said probes, 
5 further characterized in that said probes specifically hybridize to any of the target 

sequences presented in figure 1 , or to the complement of said target sequences: 
d) inferring from the result of step c) whether or not a mutation giving rise to drug resistance is 
present in any of said target sequences. 
The numbering of HIV- 1 protease gene encoded amino acids is as generally accepted in literature. 
10 Mutations that give rise to an amino acid change at position 48 or 90 are known to confer resistance to 
saquinavir (Erlebe et al; Tisdale et al). An amino acid change at codon 46 or 54 or 82 or 84 results in 
ritonavir and indinavir resistance (Kempf et al; Emini et al; Condra et al). Amino acid changes at 
positions 30 and 46 confer resistance to nelfinavir (Patick et al) an(| amino acid changes at position 50 
confers resistance to VX-487 (Rao et al). Therefore, the method described above allows to determine 
1 5 whether a HIV strain is susceptible or resistant to any of the drugs mentioned above. This method can 
be used, for instance, to screen for mutations conferring resistance to any of the mentioned drugs before 
initiating therapy. This method may also be used to screen for mutations that may arise during the course 
of therapy (i.e. monitoring of drug therapy). It is obvious that this method may also be used to determine 
resistance to drugs other than the above-mentioned drugs, provided that resistance to these other drugs 
20 is linked to mutations that can be detected by use of this method. This method may also be used for the 
specific detection of polymorphic nucleotides. It is to be understood that the said probes may only partly 
overlap with the targets sequences of figure 1, table 2 and table 3, as long as they allow for specific 
detection of the relevant polymorphic nucleotides as indicated above. The sequences of figure 1, table 
2 and table 3 were derived from polynucleic acid fragments comprising the protease gene. These 
25 fragments were obtained by PCR amplification and were inserted into a cloning vector and sequence 
analyzed as described in example 1 . It is to be noted that some polynucleic acid fragments comprised 
polymorphic nucleotides in their sequences, which have not been previously disclosed. These novel 
polymorphic nucleotide sequences are represented in table 4 below. 

30 TABLE 4: Polymorphic nucleotide sequences. 



51 


52 


53 


54 


55 


56 


57 


58 


codon position 


gga 


ggt 


ttt 


ate 


aaa 


gta 


aga 


cag 


consensus sequence 


GGA 


GGT 


TTT 


ATC 


AAA 


GTC 


AGA 


CAA 


SEQ ID NO 478 


GGA 


GGT 


TTC 


ATT 


AAG 


GTA 


AAA 


CAG 


SEQ ID NO 479 


GGA 


GGT 


TTT 


ATT 


AAG 


GTA 


AGA 


CAG 


SEQ ID NO 480 
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GGA 


GGT 


TTT 


ATT 


AAA 


GTA 


AGA CAA 




GGA 


GGC 


TTT 


ATC 


AAA 


GTA 


AGA CAA 




GGA 


GGT 


TTT 


ATC 


AAA 


GTC 


AGA CAA 




78 


79 


80 


81 


82 


83 


84 85 




gga 


cct 


aca 


cct gtc 


aac 


ata att 


gg 


GGA 


CCT 


ACA 


CCG 


GTC 


AAC 


ATA ATT 


GG 


GGA 


CCT 


ACA 


CCT 


GCC 


AAT 


ATA ATT 


GG 


GGA 


CCT 


ACG 


CCC 


TTC 


AAC 


ATA ATT 


GG 


GGA 


CCG 


ACA 


CCT 


GTC 


ACC 


ATA ATT 


GG 


GGA 


CCT 


ATA 


CCT 


GTC 


AAC 


ATA ATT 


GG 



87 88 89 90 91 92 93 94 
a aga aat ctg ttg act cag att ggc 

A AAA AAT CTG ATG ACT CAG ATT GGC 
A AGA ACT CTG TTG ACT CAG CTT GGA 
A AGA AAT ATG ATG ACC CAG CTT GGC 
A AGA AAT ATA ATG ACT CAG CTT GGA 
A AGA AAT CTG CTG ACT CAG ATT GGG 
A AGA AAT CTG TTG ACA CAG CTT GGC 
A AGA AAT ATG TTG ACT CAG CTT GGT 
A AGA AAT TTG TTG ACT CAG ATT GGG 
A AGA AAT ATG TTG ACT CAG CTT GGT 
A AGA AAT ATG TTG ACT CAG CTT GGA 
A AGA AAT CTG TTG ACT CAG CTT GGA 
A AGA AAC CTG TTG ACT CAA CTT GGT 



SEQ ID NO 481 
SEQ ID NO 4 82 
SEQ ID NO 483 

codon position 
consensus sequence 

SEQ ID NO 4 84 
SEQ ID NO 485 
SEQ ID NO 486 
SEQ ID NO 487 
SEQ ID NO 488 

codon position 
consensus sequence 

SEQ ID NO 489 
SEQ ID NO 490 
SEQ ID NO 491 
SEQ ID NO 4 92 
SEQ ID NO 4 93 
SEQ ID NO 4 94 
SEQ ID NO 4 95 
SEQ ID NO 4 96 
SEQ ID NO 4 97 
SEQ ID NO 498 
SEQ ID NO 499 
SEQ ID NO 500 



The present invention thus also relates to these novel sequences, or a fragment thereof, wherein said 
fragment consists of at least 10, preferably 15, even more preferably 20 contiguous nucleotides and 
contains at least one polymorphic nucleotide. It is furthermore to be understood that these new 
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polymorphic nucleotides may also be expected to arise in another sequence context than in the mentioned 
sequences. For instance a G at the third position of codon 55 is shown in SEQ ID N° 478 in 
combination with a T at the third position of codon 54, but a G at the third position of codon 55 may also 
be expected to occur in the context of a wild type sequence. It is also to be understood that the above 
5 mentioned specifications apply to the complement of the said target sequences as well. This applies also 
to Figure 1. 

According to a preferred embodiment the present invention relates to a method as indicated 
above, further characterized in that said probes are capable of simultaneously hybridizing to their 
respective target regions under appropriate hybridization and wash conditions allowing the detection of 
10 the hybrids formed. 

According to a preferred embodiment, step c is performed using a set of at least 2, preferably at 
least 3, more preferably at least 4 and most preferably at least 5 probes meticulously designed as such 
that they show the desired hybridization results. In general this method may be used for any purpose that 
relies on the presence or absence of mutations that can be detected by this method, e.g. for genotyping. 
15 The probes of table 1 have been optimized to give specific hybridization results when used in a LiPA 
assay (see below), as described in examples 2 and 3. These probes have thus also been optimized to 
simultaneously hybridize to their respective target regions under the same hybridization and wash 
conditions allowing the detection of hybrids. The sets of probes for each of the codons 30, 46/48, 50, 
54 and 82/84 have been tested experimentally as described in examples 2 and 3. The reactivity of the 
20 sets shown in table 1 with 856 serum samples from various geographic origins was evaluated. It was 
found that the sets of probes for codons 30, 46/48, 50, 54 and 82/84 reacted with 98.9%, 99.6%, 98.5%, 
99.2%, 95.4% and 97.2% of the test samples, respectively. The present invention thus also relates to the 
sets of probes for codons 30, 46/48, 50, 54, 82/84 and 90, shown in table 1 and table 7. 

According to another even more preferred embodiment, the present invention relates to a method 
25 as defined above, further characterized in that: 

step b) comprises amplifying a fragment of the protease gene with at least one 5'-primer 
specifically hybridizing to a target sequence located bet\yeen nucleotide position 210 and 
nucleotide position 260 (codon 87), more preferably between nucleotide position 220 and 
nucleotide position 260 (codon 87), more preferably between nucleotide position 230 and 
30 nucleotide position 260 (codon 87), even more preferably at nucleotide position 241 to 
nucleotide position 260 (codon 87) in combination with at least one suitable 3'- 
primer, and 

step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence comprising codon 90. 
35 According to another even more preferred embodiment, the present invention relates to a method 

as defined above, further characterized in that: 
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step b) comprises amplifying a fragment of the protease gene with at least one 3 '-primer 
specifically hybridizing to a target sequence located between nucleotide position 253 (codon 85) 
and nucleotide positions 300, more preferably between nucleotide position 253 (codon 85) 
and nucleotide positions 290, more preferably between nucleotide position 253 (codon 85) 
and nucleotide positions 280, even more preferably at nucleotide position 253 (codon 85) 
to nucleotide position 273 (codon 91), in combination with at least one suitable 5'-primer, and 
step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence comprising any of codons 30, 46, 48, 50, 52, 54, 82 and 
84. 

It has been found, unexpectedly, that an amplified nucleic acid fragment comprising all of the above- 
mentioned codons, does not hybridize optimally to probes comprising codon 82, 84 or 90. On the other 
hand, a shorter fragment, for instance the fragment which is amplified by use of the primers Prot41bio 
and Prot6bio with respectively seq id no 5 and seq id no 4, hybridizes better to probes comprising codon 
90. Better hybridization is also obtained when the fragment is amplified with primer Prot41bio in 
combination with primers Prot6abio, Prot6bbio, Prot6cbio and Prot6dbio The present invention thus also 
relates to a method as defined above, further characterized in that the 5 f -primer is seq id no 5 and at least 
one 3' primer is chosen from seq id no 4, seq id no506, seq id no 507, seq id no 508, and seq id no 509. 
Likewise, another shorter fragment, for instance the fragment which is amplified by use of the primers 
Prot2bio and Prot3 lbio with respectively seq id no 3 and seq id no 6 , was found to hybridize better to 
probes comprising codon 82 and/or 84. Hence the present invention also relates to a method as defined 
above, further characterized in that the 5'-primer is seq id no 5 and at least one 3'-primer is chosen from 
seq id no 4, seq id no506, seq id no 507, seq id no 508, and seq id no 509.. 

New sets of amplification primers as mentioned in example 1 were selected. The present invention thus 
also relates to primers: prot 16 (SEQ ID NO 501), prot 5 (SEQ ID NO 5), prot2a bio (SEQ ID NO 503), 
prot2b bio (SEQ ID NO 504), prot3 1 bio (SEQ ID NO 6), prot41-bio (SEQ ID NO 505), prot6a (SEQ 
ID NO 506), prot6b (SEQ ID NO 507), prot6c (SEQ ID NO 508) and prot6d (SEQ ID NO 509). A 
number of these primers are chemically modified (biotinylated), others are not. The present invention 
relates to any of the primers mentioned, primers containing unmodified nucleotides, or primers 
containing modified nucleotides. 

Different techniques can be applied to perform the sequence-specific hybridization methods of the 
present invention. These techniques may comprise immobilizing the amplified HIV polynucleic acids 
on a solid support and performing hybridization with labeled oligonucleotide probes. HIV polynucleic 
acids may also be immobilized on a solid support without prior amplification and subjected to 
hybridization. Alternatively, the probes may be immobilized on a solid support and hybridization may 
be performed with labeled HIV polynucleic acids, preferably after amplification. This technique is called 
reverse hybridization. A convenient reverse hybridization technique is the line probe assay (LiPA). This 
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assay uses oligonucleotide probes immobilized as parallel lines on a solid support strip (Stuyver et al., 
1993). It is to be understood that any other technique based on the above-mentioned methods is also 
covered by the present invention. 

According to another preferred embodiment, the present invention relates to any of the probes 
mentioned above and/or to any of the primers mentioned above, with said primers and probes being 
designed for use in a method for determining the susceptibility to antiviral drugs of HIV viruses in a 
sample. According to an even more preferred embodiment, the present invention relates to the probes 
with seq id no 7 to seq id no 477 and seq id no510 to seq id no 519, more preferably to the seq id no 
mentioned in Table 1 and Table 7, and to the primers with seq id no 3, 4, 5 and 6, 501, 502, 503, 504, 
505, 506, 507, 508 and 509. The skilled man will recognize that addition or deletion of one or more 
nucleotides at their extremities may adapt the said probes and primers. Such adaptations may be required 
if the conditions of amplification or hybridization are changed, or if the amplified material is RNA 
instead of DNA, as is the case in the NASBA system. 

According to another preferred embodiment, the present invention relates to a diagnostic kit 
enabling a method for determining the susceptibility to antiviral drugs of HIV viruses in a biological 
sample, with said kit comprising: 

a) when appropriate, a means for releasing, isolating or concentrating the polynucleic acids present 
in said sample; 

b) when appropriate, at least one of the primers of any of claims 4 to 6; 

c) at least one of the probes of any of claims 1 to 3, possibly fixed to a solid support; 

d) a hybridization buffer, or components necessary for producing said buffer; 

e) a wash solution, or components necessary for producing said solution; 

f) when appropriate, a means for detecting the hybrids resulting from the preceding hybridization; 
h) when appropriate, a means for attaching said probe to a solid support. 

DEFINITIONS 

The following definitions serve to illustrate the terms and expressions used in the present invention. 

The term "antiviral drugs" refers particularly to any antiviral protease inhibitor. Examples of 
such antiviral drugs and the mutation they may cause in the HIV protease gene are disclosed in Schinazi 
et al., 1997. The contents of the latter two documents particularly are to be considered as forming part 
of the present invention. The most important antiviral drugs focussed at in the present invention are 
disclosed in Tables 1 to 2. 

The target material in the samples to be analyzed may either be DNA or RNA, e.g.: genomic 
DNA, messenger RNA, viral RNA or amplified versions thereof. These molecules are also termed 
polynucleic acids. 

It is possible to use genomic DNA or RNA molecules from HIV samples in the methods 
according to the present invention. 
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Well-known extraction and purification procedures are available for the isolation of RNA or 
DNA from a sample (fi. in Maniatis et al., Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold 
Spring Harbour Laboratory Press ( 1 989)). 

The term "probe" refers to single stranded sequence-specific oligonucleotides, which have a 
5 sequence, which is complementary to the target sequence to be detected. 

The term "target sequence" as referred to in the present invention describes the wild type 
nucleotide sequence, or the sequence comprising one or more polymorphic nucleotides of the protease 
gene to be specifically detected by a probe according to the present invention. This nucleotide sequence 
may encompass one or several nucleotide changes. Target sequences may refer to single nucleotide 
10 positions, codon positions, nucleotides encoding amino acids or to sequences spanning any of the 
foregoing nucleotide positions. In the present invention said target sequence often includes one or two 
variable nucleotide positions. 

The term "polymorphic nucleotide" indicates a nucleotide in the protease gene of a particular 
HIV virus that is different from the nucleotide at the corresponding position in at least one other HIV 
15 virus. The polymorphic nucleotide may or may not give rise to resistance to an antiviral drug. 

It is to be understood that the complement of said target sequence is also a suitable target sequence in 
some cases. The target sequences as defined in the present invention provide sequences which should 
be complementary to the central part of the probe which is designed to hybridize specifically to said 
target region. 

20 The term "complementary" as used herein means that the sequence of the single stranded probe 

is exactly the (inverse) complement of the sequence of the single-stranded target, with the target being 
defined as the sequence where the mutation to be detected is located. 

"Specific hybridization" of a probe to a target sequence of the HIV polynucleic acids means that 
said probe forms a duplex with part of this region or with the entire region under the experimental 

25 conditions used, and that under those conditions said probe does not form a duplex with other regions 
of the polynucleic acids present in the sample to be analyzed. 

Since the current application requires the detection of single basepair mismatches, very stringent 
conditions for hybridization are required, allowing in principle only hybridization of exactly 
complementary sequences. However, variations are possible in the length of the probes (see below), and 

30 it should be noted that, since the central part of the probe is essential for its hybridization characteristics, 
possible deviations of the probe sequence versus the target sequence may be allowable towards head and 
tail of the probe, when longer probe sequences are used. These variations, which may be conceived from 
the common knowledge in the art, should however always be evaluated experimentally, in order to check 
if they result in equivalent hybridization characteristics than the exactly complementary probes. 

35 Preferably, the probes of the invention are about 5 to 50 nucleotides long, more preferably from 

about 10 to 25 nucleotides. Particularly preferred lengths of probes include 10, 1 1, 12, 13, 14, 15, 16, 17, 
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18, 19, 20, 21, 22, 23, 24 or 25 nucleotides. The nucleotides as used in the present invention may be 
ribonucleotides, deoxyribonucleotides and modified nucleotides such as inosine or nucleotides containing 
modified groups, which do not essentially alter their hybridization characteristics. 

Probe sequences are represented throughout the specification as single stranded DNA 
5 oligonucleotides from the 5' to the 3' end. It is obvious to the man skilled in the art that any of the below- 
specified probes can be used as such, or in their complementary form, or in their RNA form (wherein 
U replaces T). 

The probes according to the invention can be prepared by cloning of recombinant plasmids 
containing inserts including the corresponding nucleotide sequences, if need be by cleaving the latter out 
1 0 from the cloned plasmids upon using the adequate nucleases and recovering them, e.g. by fractionation 
according to molecular weight. The probes according to the present invention can also be synthesized 
chemically, for instance by the conventional phospho-triester method. 

The term "solid support" can refer to any substrate to which an oligonucleotide probe can be 
coupled, provided that it retains its hybridization characteristics and provided that the background level 
15 of hybridization remains low. Usually the solid substrate will be a microtiter plate, a membrane (e.g. 
nylon or nitrocellulose) or a microsphere (bead) or a chip. Prior to application to the membrane or 
fixation it may be convenient to modify the nucleic acid probe in order to facilitate fixation or improve 
the hybridization efficiency. Such modifications may encompass homopolymer tailing, coupling with 
different reactive groups such as aliphatic groups, NH 2 groups, SH groups, carboxylic groups, or 
20 coupling with biotih, haptens or proteins. 

The term "labeled" refers to the use of labeled nucleic acids. Labeling may be carried out by the 
use of labeled nucleotides incorporated during the polymerase step of the amplification such as illustrated 
by Saiki et al. (1988) or Bej et al. (1990) or labeled primers, or by any other method known to the person 
skilled in the art. The nature of the label may be isotopic ( 32 P, 35 S, etc.) or non-isotopic (biotin, 
25 digoxigenin, etc.). 

The term "primer" refers to a single stranded oligonucleotide sequence capable of acting as a 
point of initiation for synthesis of a primer extension product, which is complementary to the nucleic 
acid strand to be copied. The length and the sequence of the primer must be such that they allow to prime 
the synthesis of the extension products. Preferably the primer is about 5-50 nucleotides long. Specific 
30 length and sequence will depend on the complexity of the required DNA or RNA targets, as well as on 
the conditions of primer use such as temperature and ionic strength. 

The term "primer pair" refers to a set of primers comprising at least one 5* primer and one 3' 
primer. The primer pair may consist of more than two primers, the complexity of the number of primers 
will depend on the hybridization conditions, variability of the sequences in the regions to be amplified 
35 and the target sequences to be detected. 

The fact that amplification primers do not have to match exactly with the corresponding template 
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sequence to warrant proper amplification is amply documented in the literature (Kwok et al., 1990). 

The amplification method used can be either polymerase chain reaction (PCR; Saiki et al., 1988), 
ligase chain reaction (LCR; Landgren et al., 1988; Wu & Wallace, 1989; Barany, 1991), nucleic acid 
sequence-based amplification (NASBA; Guatelli et al., 1990; Compton, 1991), transcription-based 
5 amplification system (TAS; Kwoh et al., 1989), strand displacement amplification (SDA; Duck, 1990) 
or amplification by means of QB replicase (Lomeli et al., 1989) or any other suitable method to amplify 
nucleic acid molecules known in the art. 

The oligonucleotides used as primers or probes may also comprise nucleotide analogues such 
as phosphorothiates (Matsukura et al., 1987)1, alkylphosphorothiates (Miller et al., 1979) or peptide 
1 0 nucleic acids (Nielsen et al., 1 991 ; Nielsen et al., 1 993) or may contain intercalating agents (Asseline et 
al, 1984). 

As most other variations or modifications introduced into the original DNA sequences of the 
invention these variations will necessitate adaptations with respect to the conditions under which the 
oligonucleotide should be used to obtain the required specificity and sensitivity. However the eventual 
1 5 results of hybridization will be essentially the same as those obtained with the unmodified 
oligonucleotides. 

The introduction of these modifications may be advantageous in order to positively influence 
characteristics such as hybridization kinetics, reversibility of the hybrid-formation, biological stability 
of the oligonucleotide molecules, etc. 

20 The "sample" may be any biological material taken either directly from the infected human being 

(or animal), or after culturing (enrichment). Biological material may be e.g. expectorations of any kind, 
broncheolavages, blood, skin tissue, biopsies, sperm, lymphocyte blood culture material, colonies, liquid 
cultures, fecal samples, urine etc. 

The sets of probes of the present invention will include at least 2, 3, 4, 5, 6, 7, 8, 9, 10, 1 1, 12, 

25 13, 14, 15, 16, 17, 1 8, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more probes. Said probes may be 
applied in two or more distinct and known positions on a solid substrate. Often it is preferable to apply 
two or more probes together in one and the same position of said solid support. 

For designing probes with desired characteristics, the following useful guidelines known to the 
person skilled in the art can be applied. 

30 Because the extent and specificity of hybridization reactions such as those described herein are 

affected by a number of factors, manipulation of one or more of those factors will determine the exact 
sensitivity and specificity of a particular probe, whether perfectly complementary to its target or not. The 
importance and effect of various assay conditions, explained further herein, are known to those skilled 
in the art. 

35 The stability of the [probe : target] nucleic acid hybrid should be chosen to be compatible with 

the assay conditions. This may be accomplished by avoiding long AT-rich sequences, by terminating the 
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hybrids with G;C base pairs, and by designing the probe with an appropriate Tm. The beginning and end 
points of the probe should be chosen so that the length and %GC result in a Tm about 2-10°C higher than 
the temperature at which the final assay will be performed. The base composition of the probe is 
significant because G-C base pairs exhibit greater thermal stability as compared to A-T base pairs due 
to additional hydrogen bonding. Thus, hybridization involving complementary nucleic acids of higher 
G-C content will be stable at higher temperatures. 

Conditions such as ionic strength and incubation temperature under which a probe will be used 
should also be taken into account when designing a probe. It is known that hybridization will increase 
as the ionic strength of the reaction mixture increases, and that the thermal stability of the hybrids will 
increase with increasing ionic strength. On the other hand, chemical reagents, such as formamide, urea, 
DMSO and alcohols, which disrupt hydrogen bonds, will increase the stringency of hybridization. 
Destabilization of the hydrogen bonds by such reagents can greatly reduce the Tm. In general, optimal 
hybridization for synthetic oligonucleotide probes of about 10-50 bases in length occurs approximately 
5 C C below the melting temperature for a given duplex. Incubation at temperatures below the optimum 
may allow mismatched base sequences to hybridize and can therefore result in reduced specificity. 

It is desirable to have probes, which hybridize only under conditions of high stringency. Under 
high stringency conditions only highly complementary nucleic acid hybrids will form; hybrids without 
a sufficient degree of complementarity will not form. Accordingly, the stringency of the assay conditions 
determines the amount of complementarity needed between two nucleic acid strands forming a hybrid. 
The degree of stringency is chosen such as to maximize the difference in stability between the hybrid 
formed with the target and the nontarget nucleic acid. In the present case, single base pair changes need 
to be detected, which requires conditions of very high stringency. 

The length of the target nucleic acid sequence and, accordingly, the length of the probe sequence 
can also be important. In some cases, there may be several sequences from a particular region, varying 
in location and length, which will yield probes with the desired hybridization characteristics. In other 
cases, one sequence may be significantly better than another that differs merely by a single base. While 
it is possible for nucleic acids that are not perfectly complementary to hybridize, the longest stretch of 
perfectly complementary base sequence will normally primarily determine hybrid stability. While 
oligonucleotide probes of different lengths and base composition may be used, preferred oligonucleotide 
probes of this invention are between about 5 to 50 (more particularly 10-25) bases in length and have a 
sufficient stretch in the sequence which is perfectly complementary to the target nucleic acid sequence. 

Regions in the target DNA or RNA, which are known to form strong internal structures 
inhibitory to hybridization, are less preferred. Likewise, probes with extensive self-complementarity 
should be avoided. As explained above, hybridization is the association of two single strands of 
complementary nucleic acids to form a hydrogen bonded double strand. It is implicit that if one of the 
two strands is wholly or partially involved in a hybrid that it will be less able to participate in formation 
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of a new hybrid. There can be intramolecular and intermolecular hybrids formed within the molecules 
of one type of probe if there is sufficient self complementarity. Such structures can be avoided through 
careful probe design. By designing a probe so that a substantial portion of the sequence of interest is 
single stranded, the rate and extent of hybridization may be greatly increased. Computer programs are 
available to search for this type of interaction. However, in certain instances, it may not be possible to 
avoid this type of interaction. 

Standard hybridization and wash conditions are disclosed in the Materials & Methods section 
of the Examples. Other conditions are for instance 3X SSC (Sodium Saline Citrate), 20% deionized FA 
(Formamide) at 50°C 

Other solutions (SSPE (Sodium saline phosphate EDTA), TMAC1 (Tetramethyl ammonium 
Chloride), etc.) and temperatures can also be used provided that the specificity and sensitivity of the 
probes is maintained. If need be, slight modifications of the probes in length or in sequence have to be 
carried out to maintain the specificity and sensitivity required under the given circumstances. 

Primers may be labeled with a label of choice (e.g. biotin). Different primer-based target 
amplification systems may be used, and preferably PCR-amplification, as set out in the examples. Single- 
round or nested PCR may be used. 

The term "hybridization buffer" means a buffer enabling a hybridization reaction to occur 
between the probes and the polynucleic acids present in the sample, or the amplified products, under the 
appropriate stringency conditions. 

The term "wash solution" means a solution enabling washing of the hybrids formed under the 
appropriate stringency conditions. 

The following examples only serve to illustrate the present invention. These examples are in no way 
intended to limit the scope of the present invention. 

FIGURE AND TABLE LEGENDS 

Figure 1 : Natural and drug selected variability in the vicinity of codons 30, 46, 48, 50, 54, 82, 
84, and 90 of the HIV-1 protease gene. The most frequently observed wild-type sequence is shown in 
the top line. Naturally occurring variations are indicated below and occur independently from each other. 
Drug-selected variants are indicated in bold 

Figure 2 A: Reactivities of the selected probes for codon 30 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 30 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is shown at the left and is given in Table 1. Each strip 
is incubated with a biotinylated PCR fragment from the reference panel. The reference panel accession 
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numbers are indicated in Table 1 . For several probes multiple reference panel possibilities are available, 
but only one relevant accession number given each time. *: False positive reactivities. At the bottom the 
strips, the amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 B: Reactivities of the selected probes for codons 46 and 48 immobilized on LiPA strips with 
reference material. The information in the boxed surface is not relevant for the discussion of probes for 
condoris 46 and 48 The position of each selected probe on the membrane strip is shown at the left of each 
panel The sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated 
with a biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. On top of the strips, the amino 
acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 C: Reactivities of the selected probes for codon 50 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 50. 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 
biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 
amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 D: Reactivities of the selected probes for codon 54 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 54. 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 
biotinylated PCR fragment from the reference panel. The reference panel accession numbers are 
indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 
amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 2 E.:Reactivities of the selected probes for codons 82 and 84 immobilized on LiPA strips with 
reference material. The information in the boxed surface is not relevant for the discussion of probes for 
condons 82 and 84. The position of each selected probe on the membrane strip is shown at the left of 
each panel. The sequence of the relevant part of the selected probes is given in Table 1. Each strip is 
incubated with a biotinylated PCR fragment from the reference panel. The reference panel accession 
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numbers are indicated in Table 1 . For several probes multiple reference panel possibilities are available, 
but only one relevant accession number given each time. *: False positive reactivities. At the bottom of 
the strips, the amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

5 Figure 2 F: Reactivities of the selected probes for codon 90 immobilized on LiPA strips with reference 
material. The information in the boxed surface is not relevant for the discussion of probes for condon 90. 
The position of each selected probe on the membrane strip is shown at the left of each panel. The 
sequence of the relevant part of the selected probes is given in Table 1. Each strip is incubated with a 
biotinylated PGR fragment from the reference panel. The reference panel accession numbers are 

1 0 indicated in Table 1 . For several probes multiple reference panel possibilities are available, but only one 
relevant accession number given each time. *: False positive reactivities. At the bottom of the strips, the 
amino acids at the relevant codon, as derived from the probe reactivity, is indicated. 

Figure 3: Sequence and position of the HIV-1 protease amplification primers. To obtain the reactivity 
1 5 with probes selected to determine the susceptibility to antiviral drugs involving codons 30, 46, 48, 50, 
54, 82, and 84, nested amplification primers prot2bio(5* primer) and Prot31bio (3* primer) were designed. 
To obtain the reactivity with probes selected to determine the susceptibility to antiviral drugs involving 
codon 90, nested amplification primers Prot41bio (5 1 primer) and Prot6bio (3' primer) were designed. 

20 Figure 4 A: Phylogenetic analysis on 3 12 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 30 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

25 

Figure 4 B: Phylogenetic analysis on 3 12 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codons 46/48 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
30 probes is given in Table 1 . 

Figure 4 C: Phylogenetic analysis on 3 12 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 50 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
35 indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 
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Figure 4 D: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codon 54 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
5 indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1. 

Figure 4 E: Phylogenetic analysis oh 312 protease sequences allowed to separate genotype B strains from 
non-B strains. Reactivities of the selected probes for codons 82/84 immobilized on LiPA strips with a 
1 0 biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

Figure 4 F: Phylogenetic analysis on 312 protease sequences allowed to separate genotype B strains from 
15 non-B strains. Reactivities of the selected probes for codon 90 immobilized on LiPA strips with a 
biotinylated PCR fragment of genotype B strains and non-B strains is shown, the exact percentages are 
indicated in table 5. The probes are indicated at the bottom. The sequence of the relevant part of the 
probes is given in Table 1 . 

20 Figure 5 A: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 30. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 B: Geographical origin of 856 samples and reactivities with the different probes at codon 
positions 46/48. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

25 

Figure 5 C: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 50. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 D: Geographical origin of 856 samples and reactivities with the different probes at codon 
30 position 54. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

Figure 5 E: Geographical origin of 856 samples and reactivities with the different probes at codon 
positions 82/84. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 

35 Figure 5 F: Geographical origin of 856 samples and reactivities with the different probes at codon 
position 90. The exact percentages are indicated in table 6. The probes are indicated at the bottom. 
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Table 1: HTV-1 protease wild-type and drug-selected mutation probes with their corresponding 
sequences as applied on the HIV-1 protease LiPA strip. The most frequently observed wild-type 
sequence is shown at the top line. Probe names corresponding to the selected motifs are indicated in the 
5 left column, the relevant part of each probe applied on the strip is shown under the consensus sequence. 

Table 2: Protease Inhibitors. 

Table 3: HIV-1 protease wild-type and drug-selected mutation probes with their corresponding 
1 0 sequences as synthesized, immobilized and tested on LiPA strips. The most frequently observed wild- 
type sequence is shown at the top line. Probe names corresponding to the selected motifs are indicated 
in the left column, the relevant part of each probe applied on the strip is shown under the consensus 
sequence. The probes retained are indicated in table 1. 

15 Table 4: Polymorphic nucleotide sequences. 

Table 5: % Reactivities of the HTV-1 protease wild-type and drug- selected mutation probes applied on 
the HIV-1 protease LiPA strip with genotype B strains and non-B strains. 

20 Table 6: % Reactivities of the HTV-1 protease wild-type and drug-selected mutation probes applied on 
the HIV-1 protease LiPA strip with samples of different geographical origin. 

Table 7: HIV-1 protease wild-type and drug-selected mutation probes with their corresponding sequences 
as applied on the HTV-1 protease LiPA strip. The most frequently observed wild-type sequence is shown 
25 at the top line. Probe names corresponding to the selected motifs are indicated in the left column, the 
relevant part of each probe applied on the strip is shown under the consensus sequence. 

EXAMPLES 

30 Example 1: 

Selection of the plasma samples, PCR amplification and cloning of the PCR products . 

Plasma samples (n=557) were taken from HTV type-1 infected patients and stored at -20°C until 
use. Plasma samples were obtained from naive and drug-treated patients. The drugs involved ritonavir, 
35 indinavir and saquinavir. The serum samples were collected from patients residing in Europe (Belgium, 
Luxembourg, France, Spain and UK), USA and Brazil. 
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HIV RNA was prepared from these samples using the guanidinium-phenol procedure. Fifty ^1 
plasma was mixed with 150 y\ Trizol®LS Reagent (Life Technologies, Gent, Belgium) at room 
temperature (volume ratio: lunit sample/ 3 units Trizol ). Lysis and denaturation occurred by carefully 
pipetting up and down several times, followed by an incubation step at room temperature for at least 5 
5 minutes. Fourthy p.1 CHC1 3 was added and the mixture was shaken vigorously by hand for at least 15 
seconds, and incubated for 15 minutes at room temperature. The samples were centrifiiged at maximum 
12,000g for 15 minutes at 4°C, and the colorless aqueous phase was collected and mixed with 100 fxl 
isopropanol. To visualize the minute amounts of viral RNA, 20 fxl of 1 jxg/jal Dextran T500 (Pharmacia) 
was added, mixed and left at room temperature for 10 minutes. Following centrifugation at max. 12,000g 

10 for 1 0 minutes at 4°C and aspiration of the supernatant, the RNA pellet was washed with 200 |il ethanol, 
mixed by vortexing and collected by centrifugation at 7,500g for 5 minutes at 4°C. Finally the RNA 
pellet was briefly air-dried and stored at -20°C. Alternatively, the High Pure Viral Nucleic Acid Kit 
(Boehringer Mannheim ) was used to extract RNA from the samples 

For cDNA synthesis and PCR amplification, the RNA pellet was dissolved in 15 \i\ random 

1 5 primers (20 ng/^il, pdN 6 , Pharmacia), prepared in DEPC-treated or HPLC grade water. After denaturation 
at 70°C for 10 minutes, 5 ^1 cDNA mix was added, composed of 4 ^1 5x AMV-RT buffer (250mM 
Tris.HCl pH 8.5, lOOmM KC1, 30mM MgCl 2 , 25 mM DTT), 0.4 jiL 25mM dXTPs, 0.2 or 25U 
Ribonuclease Inhibitor (HPRI, Amersham), and 0.3 |al or 8U AMV-RT (Stratagene). cDNA synthesis 
occurred during the 90 minutes incubation at 42°C The HTV -1 protease gene was than amplified using 

20 the following reaction mixture: 5 jllI cDNA, 4.5 fxl lOx Taq buffer, 0.3 jil 25 mM dXTPs, 1 \i\ (10 pmol) 
of each PCR primer, 38 ^1 H 2 0, and 0.2 \il (1 U) Taq. . Alternatively, the Titon One Tube RT-PCR 
system (Boehringer Mannheim) was used to perform RT-PCR. 

Codon positions involving resistance to saquinavir, ritonavir, indinavir, nelfinavir and VX-478 have been 
described (Shinazi et at) and PCR amplification primers were chosen outside these regions. The primer 

25 design was based on HIV-1 published sequences (mainly genotype B clade) (Myers et al) and located 
in regions that showed a high degree of nucleotide conservation between the different HIV-1 clades. The 
final amplified region covered the HIV-1 protease gene from codon 9 to codon 99. The primers for 
amplification had the following sequence: outer sense primer Prl6: 5* bio- 
CAGAGCCAACAGCCCCACCAG3' (SEQ ID NO 1); nested sense primer Prot 2 bio: 5* CCT CAR ATC 

30 ACT CTT TGG CAA CG 3' (SEQ ID NO 3) ; nested antisense primer Prot 6 bio: 3* TAA TCR GGA 
TAA CTY TGA CAT GGT C 5* (SEQ ID NO 4); and outer antisense primer RT12: 5' 
bioATCAGGATGGAGTTCATAACCCATCCA3 ! (SEQ ID NO 2). Annealing occurred at 57°C, 
extension at 72°C and denaturation at 94°C. Each step of the cycle took 1 minute, the outer PCR 
contained 40 cycles, the nested round 35. Nested round PCR products were analyzed on agarose gel and 

35 only clearly visible amplification products were used in the LiPA procedure. Quantification of viral 
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RNA was obtained with the HIV Monitor™test (Roche, Brussels, Belgium). Later on, new sets of 
primers for amplification were selected. For the amplification of HIV protease codon 30-84: outer sense 
primer protl6: 5'-CAGAGCCAACAGCCCCACCAG-3' (SEQ ID NO 501), outer antisense primer 
prot5: 5 , -TTTTCTrCTGTCAATGGCCATTG'nT-3 » (SEQ ID NO 502) were used Annealing occurred 
5 at 50°C, extension at 68°C and denaturation at 94°C for 35 cycles for the outer PCR. For the nested PCR 
annealing occurred at 45°C, denaturation at 94°C and extension at 92°C with primers: nested sense 
primers prot2a-bio: S'-bio-CCTCAAATCACTCTTTGGCAACG-S' (SEQ ID NO 503)and prot2b-bio: 
5^bio-CCTCAGATCAC^CTTTGGCAACG-3 , (SEQ ID NO 504), and nested antisense primer prom- 
bio: 5 1 -bio- AGTCAACAGATTTCTTCCAAT-3 ' (SEQ ID NO 6). For the amplification of HIV protease 
1 0 codon 90, the outer PCR was as specified for HTV protease codon 30-84. For the nested PCR, nested 
sense primer prot41-bio: 5 ' -bio-CCTGTC AAC ATAATTGC AAG-3 9 (SEQ ID NO 505) and nested 
antisense primers prot6a: 5 '-bio-CrGGTACAGTrTC AATAGGGCTAAT-3 * (SEQ ID NO 506), prot6b: 
5 ' -bio-CTGGTAC AGTTTCAATAGG ACTAAT-3 ' (SEQ ID NO 507), prot6c: 5*-bio- 
CTGGTACAGTCTCAATAGG ACTAAT-3 ' (SEQ ID NO 508), prot6d: 5'-bio- 
15 CTGGTACAGTCTC AATAGGGCTAAT-3 ' (SEQ ID NO 509) were used. For the nested PCR the 
annealing temperature occurred at 45°C. Primers were tested on a plasmid, which contained an HTV 
fragment of 1301 bp ligated in a pGEM-T vector. The fragment contains protease, reverse transcriptase 
and the primer sites of first and second round PCR. By restriction with Sac / the plasmid is linearised. 
Selected PCR products were cloned into the pretreated EcoRV site of the pGEMT vector 
20 (Promega). Recombinant clones were selected after a-complementation and restriction fragment length 
analysis, and sequenced using standard sequencing techniques with plasmid primers and internal HTV 
protease primers. Sometimes biotinylated fragments were directly sequenced with a dye-terminator 
protocol (Applied Biosystems) using the amplification primers. Alternatively, nested PCR was carried 
out with analogs of the nested primers, in which the biotin group was replaced with the T7- and SP6- 
25 primer sequence, respectively. These amplicons were than sequenced with an SP6- and T7-dye-primer 
procedure. 



Example 2 : 
30 Selection of a reference panel 

Codon positions involving resistance to saquinavir, ritonavir, indinavir, nelfmavir and VX-478 
have been described (Shinazi et al 1997). It was the aim to clone in plasmids those viral protease genes 
that are covering the different genetic motifs at those important codon positions conferring resistance 
against the described protease inhibitors. 
35 After careful analysis of 312 protease gene sequences, obtained after direct sequencing of PCR 

fragments, a selection of 47 PCR fragments which covered the different target polymorphisms and 
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mutations were retained and cloned in plasmids using described cloning techniques. The selection of 
samples originated from naive or drug-treated European, Brazilian or US patients. These 47 
recombinant plasmids are used as a reference panel, a panel that was sequenced on both strands, and 
biotinylated PCR products from this panel were used to optimize probes for specificity and sensitivity. 
5 Although this panel of 47 samples is a representative selection of clones at this moment, it is 

important to mention here that this selection is an fact only a temporally picture of the variability of the 
virus, and a continuous update of this panel will be mandatory. This includes on ongoing screening for 
the new variants of the virus, and recombinant cloning of these new motifs. 

10 Probe selection and LiPA testing . 

To cover all the different genetic motifs in the reference panel, a total of 471 probes were 
designed (codon 30: 40 probes; codon 46/48: 72 probes; codon 50:55 probes; codon 54: 54 probes; codori 
82/84: 130 probes; codon 90: 120 probes).Table 3 shows the different probes that were selected for the 
different codon positions. 

15 It was the aim to adapt all probes to react specifically under the same hybridization and wash 

conditions by carefully considering the % (G+C), the probe length, the final concentration of the buffer 
components, and hybridization temperature (Stuyver et al., 1997). Therefore, probes were provided 
enzymatically with a poly-T-tail using the TdT (Pharmacia) in a standard reaction condition, and purified 
via precipitation. For a limited number of probes with 3' T-ending sequences, an additional G was 

20 incoiporated between the probe sequence and the poly-T-tail in order to limit the hybridizing part to the 
specific probe sequence and to exclude hybridization with the tail sequence. Probe pellets were dissolved 
in standard saline citrate (SSC) buffer and applied as horizontal parallel lines on a membrane strip. 
Control lines for amplification (probe 5* TAGGGGGAATTGGAGGTTTTAG 3 \ HIV protease aa 47 
to aa 54) and conjugate incubation (biotinylated DNA) were applied alongside.. Probes were 

25 immobilized onto membranes by baking, and the membranes were sliced into 4mm strips also called 
LiPA strips. 

Selection of the amplification primers and PCR amplification was as described in example 1. 
In order to select specific reacting probes out of the 471 candidate probes, LiPA tests were performed 
with biotinylated PCR fragments from the reference panel. To perform LiPA tests, equal amounts (10 

30 |il) of biotinylated amplification products and denaturation mixture (0.4 N NaOH/0.1% SDS) were 
mixed, followed by an incubation at room temperature for 5 minutes. Following this denaturation step, 
2 ml hybridization buffer (2xSSC, 0.1 % SDS, 50mM Tris pH7.5) was added together with a membrane 
strip and hybridization was carried out at 39°C for 30 min. Then, the hybridization mixture was replaced 
by stringent washing buffer (same composition as hybridization buffer), and stringent washing occurred 

35 first at room temperature for 5 minutes and than at 39°C for another 25 minutes. Buffers were than 
replaced to be suitable for the streptavidine alkaline phosphatase conjugate incubations. After 30 minutes 
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incubation at room temperature, conjugate was rinsed away and replaced by the substrate components 
for alkaline phosphatase, Nitro-Blue-Tetrazolium and 5-Bromo-4-Chloro-3-Indolyl Phosphate. After 30 
minutes incubation at room temperature, probes where hybridization occurred became visible because 
of the purple brown precipitate at these positions. 
5 After careful analysis of the 471 probes, the most specific and sensitive probes (n=46) were 

finally selected, covering the natural and drug-selected variability in the vicinity of aa. 30, 46, 48, 50, 
54, 82, 84, and 90. Figure 2 shows the reactivity of the finally selected probes with the reference panel. 

Example 3 : 
10 LiPA testing on clinical samples . 

A total of 856 samples were tested on this selection of 46 specific probes. The geographical 
origin of these samples is as follows: USA:359 ; France: 154; UK:36; Brazil 58; Spain 35; Belgium 199; 
Luxembourg: 15. 

From this population, a total of 144 samples were sequenced which allowed to separate the 

15 genotype B samples (94) from the non-B samples (50). After analysis of these genotyped samples on 
LiPA, the genotypic reactivity on the selected probes was scored. Figures 4A to 4F show these results 
for the different codon positions and for the genotype B versus non-B group. From these tables, it is clear 
that there is little difference in sequence usage for the different codon positions with respect to specific 
reactivities at the different probes. 

20 The total collection of 856 samples was then tested on the available 46 probes. After dissection 

of these reactivities over the different probes and different geographical origin, the picture looks as is 
presented in Figures 5 A to 5F. Again here, the majority of the sequences used at the different codon 
positions are restricted to some very abundant wild type motifs. It is important to mention here that the 
majority of these samples are taken from patients never treated with protease inhibitors, en therefore, the 

25 majority of the reactivities are found in wild type motifs. Nevertheless, it is clear from some codon 
positions that the variability at some codon positions in the mutant motif might be considerable, and 
again, a continuos update on heavily treated patients is mandatory. Another issue is the amount of double 
blank reactivities, which is in this approach reaching up to 5% in global; with some peak values for some 
countries for some codon positions: for example 13.8% for codon 82/85 in Brazil; and 18.1 % for codon 

30 90 in Belgium. 

The continuous update resulted in a further selection of probes. This later configuration of the 
strip is indicated in table 7. 
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Table 1 - Cont'd 



86 87 88 89 90 91 92 93 94 
GGA AGA AAT CTG TTG ACT CAG ATT GGT 



pc90w27 AAT CTG TTG ACT CA 

pc90w37 AAT CTG TTG ACT CAG ATG 

pc90w39 GA ACT CTG TTG ACT C 

pc90w50 AAT ATG TTG ACT CAG 

pc90w52 AAT TTG TTG ACT CAG 

pc90w69 GA AAC CTG TTG ACT 

pc90w73 TG TTG ACA CAG CTT G 

pc90w79 TG TTG ACC CAG ATT G 

pc90m43 A AAT CTG ATG ACT CA 

pc90m56 AAT ATG ATG ACC CAG 
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Table 2 
Protease Inhibitors 



Compound 
Protease Inhibitors 
A-77003 



P9941 

Ro 31-8959 
(saquinavir) 



Amino acid change 



R8Q 
R8K 
V32I 

M46I 

M46L 
M46F 
M46V 
G48V 
A71V 
V82I 

V82A 

L63P 
A71T 
A71V 
G73S 

V82A 

V82F 

V82T 

I84V 

L90M 

V82A 

LI 01 
G48V 



Codon change 



CGAtoCAA 
CGA to AAA 
GTAtoATA 

ATGtoATA 

ATGtoTTC 
ATGtoTTC 
ATGtoGTG 
GGGtoGTG 
GCT to GTT 
GTC to ATC 

GTCtoGCC 

CTC to CCC 
GCT to ACT 
GCT to GTT 
GGT to GCT 

GTC to GCC 
GTC to TTC 
GTCtoACC 
ATAtoGTA 
TTG to ATG 

GTCtoGCC 

CTC to ATC 
GGGtoGTG 



I54V ATC to GTC 

I54V ATA to GTA 

G73S GGT to AGT 

V82A GTCtoGCC 

I84V ATA to GTA 

L90M TTG to ATG 



RPI-312 



I84V 



ATAtoGTA 
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Table 2 - Cont'd-1 



SC-52151 


L24V 


TTA to GTA 
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Table 2 - Cont'd-2 



M36I ATGtoATA 

M46I ATGtoATA 

I54L ATC to ? 

I54V ATC to GTC 

A71V GTC to GTT 

V82F GTC to TTC 

V82A GTC to GCC 

V82T GTC to ACC 

V82S GTC to TCC 

I84V ATAtoGTA 

L90M TTGtoATG 

AG1343 

(nelfinavir) D30N GAT to AAT 

M36I 

M46I ATGtoATA 

L63P CTCtoCCC 

A71V GCT to GTT 
V771 

184V ATAtoGTA 
N88D 

L90M TTGtoATG 

BILA1906 V32I GTAtoATA 

BS M46I ATG to ATA 

M46L ATG to TTG 

A71V GCT to GTT 

I84A ATA to GCA 

184V ATAtoGTA 

BILA 201 1 V32I GTA to ATA 

(palinavir) A71V GCT to GTT 

I84A ATG to ATA 

L63P CTC to CCC 

BILA 2 1 85 BS L23I CTA to ATA 

BMS 1 86,318 A71T GCT to ACT 

V82A GTC to GCC 

DMP450 LI OF CTC to TTC 
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Table 2 - Cont'd-3 

M46I ATG to ATA 

D60E GAT to G AA 

I84V ATA to GTA 



KNI-272 V32I GTA to ATA 

MK-639 L10I CTCtoATC 

(L-735,524, LI OR CTCtoCGC 

indinavir) L10V CTCtoGTC 

K20M AAGtoATG 

K20R AAGto AAA 

L24I TTAtoATA 

V32I GTA to ATA 

M46I ATG to ATA 

M46L ATGtoTTG 

I54V ATCtoGTC 
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GAT 


ACA 


p30w39 






A 


GAT 


GAT 


ACA 


p30w40 






GCA 


GAT 


GAT 


ACA 



PCT/EP99/04317 



32 33 34 35 length Seq ID 
GTA TTA GAA GAA 



PTA 
bin 


mm 

X X 


1 Q 
10 


n 

1 


PTA 
bin 


mm 
X 1 


1 Q 


Q 

0 


PT7A 
bin 


mrnjv 


T Q 


Q 

y 


bl A 


mm 

' J. 1 


Z\J 


10 


PT7A 
bl A 


TT7V 


Zl 


11 






18 


12 


GT 




19 


13 


/■> m Tv 

GTA 


m/^ 

TG 


20 


14 


GTA 


TG 


19 


15 


GG 




19 


16 


GTA 


TT 


18 


17 


GTA 


TT 


19 


18 


GTA 


mm tv 

TTA 


19 


19 


0 m tv 

GTA 


TT 


20 


20 


GTA 


TTA 


21 


21 






18 


22 


GT 




19 


23 


GTA 


TG 


20 


24 


GTA 


TG 


19 


25 


GG 




19 


26 


GT 




15 


27 


GTA 


G 


16 


28 


GTA 




15 


29 


GTA 


G 


16 


30 


GT 




14 


31 


GG 




14 


32 


GT 




13 


33 






14 


34 






13 


35 


GTA 




15 


36 


GT 




14 


37 


GT 




14 


38 


GTA 


G 


14 


39 


ATA 


TT 


16 


40 


ATA 


TG 


16 


41 


GG 




13 


42 


GT 




14 


43 


ATA 


TT 


15 


44 


ATA 


TTA 


16 


45 


ATA 




15 


46 
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Table 3 - Cont'd-l 



44 45 46 47 48 49 50 51 52 53 54 length Seq ID 
CCA AAA ATG ATA GGG GGA ATT GGA GGT TTT ATC 



P48wl 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


GG 


18 


47 


P4 8w2 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TG 


19 


48 


P48w3 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTG 


20 


49 


P48w4 






GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


21 


50 


P48w5 




G 


GTA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTG 


21 


. 51 


P48w6 




ATG 


GTA 


GGG 


GGA 


ATT 


GGA 






18 


52 


P48.w7 




ATG 


GTA 


GGG 


GGA 


ATT. 


GGA 


G 




19 


53 


P48w8 


A 


ATG 


GTA 


GGG 


GGA 


ATT 


GGA 


• 




19 


54 


P48w9 


A 


ATG 


GTA 


GGG 


GGA 


ATT 


GGA 


G 




20 


55 


P48wl0 


A 


ATG 


GTA 


GGG 


GGA 


ATT 


GGA 


GGG 


GG 


22 


56 


P48w21 




ATA 


ATA 


GGG 


GGA 


ATT 


GGA 






18 


57 


P48w22 




ATG 


ATA 


GGG 


GGA 


ATT 


GGA 






18 


58 


P48w23 


A 


ATA 


ATA 


GGG 


GGA 


ATT 


GGA 






19 


59 


P48w24 


A 


ATG 


ATA 


GGG 


GGA 


ATT 


GGA 






19 


60 


P48w25 






ATA 


GGG 


GGA 


ATT 


GGA 


GGT 


GG 


18 


61 


P48w26 






ATA 


GGG 


GGA 


ATT 


GGA 


GGT 


TG 


19 


62 


,P48w28 






ATA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTG 


20 


63 


P48w29 






ATA 


GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


21 


64 


P48mll 






GTA 


GTG 


GGA 


ATT 


GGA 


GGT 


GG 


18 


65 


P4 8ml2 






GTA 


GTG 


GGA 


ATT 


GGA 


GGT 


TG 


19 


66 


P4 8ml3 






GTA 


GTG 


GGA 


ATT 


GGA 


GGT 


TTG 


20 


67 


P4 8ml4 






GTA 


GTG 


GGA 


ATT 


GGA 


GGT 


TTT 


21 


68 


P4 8ml5 




G 


GTA 


GTG 


GGA 


ATT 


GGA 


GGT 


TTG 


21 


69 


P48ml6 




ATG 


GTA 


GTG 


GGA 


ATT 


GGA 






18 


70 


P4 8ml7 




ATG 


GTA 


GTG 


GGA 


ATT 


GGA 


G 




19 


71 


P4 8ml8 


A 


ATG 


GTA 


GTG 


GGA 


ATT 


GGA 






19 


72 


P4 8ml9 


A 


ATG 


GTA 


GTG 


GGA 


ATT 


GGA 


G 




20 


73 


P4 8m20 


A 


ATG 


GTA 


GTG 


GGA 


ATT 


GGA 


GGG 


GG 


22 


74 


P4 8m29 






ATA 


GTG 


GGA 


ATT 


GGA 


GGT 


GG 


18 


75 


P4 8m30 






ATA 


GTG 


GGA 


ATT 


GGA 


GGT 


TG 


19 


76 


P4 8m31 




ATG 


ATA 


GTG 


GGA 


ATT 


GGA 






18 


77 


P4 8m32 




ATG 


ATA 


GTG 


GGA 


ATT 


GGA 


G 




19 


78 


P4 8m33 


A 


ATG 


ATA 


GTG 


GGA 


ATT 


GGA 






19 


79 


p4 8w34 




G 


ATA 


GGG 


GGA 


ATT 


G 






14 


80 


p4 8w35 




TG 


ATA 


GGG 


GGA 


ATT 


G 






15 


81 


p4 8w36 




TG 


ATA 


GGG 


GGA 


ATT 


GG 






16 


82 


p48w37 




ATG 


ATA 


GGG 


GGA 


ATT 








15 


83 


p4 8m38 




G 


ATA 


GTG 


GGA 


ATT 


G 






14 


84 


p48m39 




TG 


ATA 


GTG 


GGA 


ATT 


G 






15 


85 


p48m40 




TG 


ATA 


GTG 


GGA 


ATT 


GG 






16 


86 


p48m41 




ATG 


ATA 


GTG 


GGA 


ATT 








15 


87 


p48w42 




ATA 


ATA 


GGG 


GGA 


ATT 








15 


88 


p48w43 




TG 


ATA 


GGG 


GGA 


GTT 








14 


89 


p48w44 




G 


ATA 


GGG 


GGA 


GTT 


G 






14 


90 


p48w45 


A 


ATG 


ATA 


GGA 


GGA 


ATT 








16 


91 


p4 8w4 6 




ATG 


ATA 


GGG 


GGA 


ATT 








15 


92 


p48w47 


AAA 


ATG 


ATA 


GGG 


GGA 










15 


93 


p48w48 


A AAA 


ATG 


ATA 


GGG 


GG 










15 


94 
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Table 3 - Cont'd-2 



p48w49 AA ATG ATA GGG GGA 

p48w50 AAA ATA ATA GGG GGA 

p48w51 AAA ATA AAA AT 

p48m52 AAA ATG ATA GTG GGA 

p48w52b AAA TTG ATA GGG GG 

p48m53 AAA ATG ATA GTG GGA 

p48w53b AAA TTG ATA GGG GGA 

p48w54 CA AAA TTG ATA G 

p48w55 ATG GTA GGG GGA 

p48w56 AA ATG GTA GGG GGA 

p48w57 A AAA ATG GTA GGG G 

p48w58 ATG ATA GGG GAA 

p48w59 ATA GGG GAA 

p48w60 ATA GGG GAA 

p48w61 ATG ATA GGG GGG 

P 48w62 ATA GGG GGG 

p48w63 A GGG GGG 

p48m64 AAA ATA ATA GTG GGA 

p48m65 A AAA ATA ATA GTG GGA 

p48m66 CA AAA ATA ATA GTG GG 

p48m67 AAA TTG ATA GTG GGA 

p48m68 A AAA TTG ATA GTG GGA 

. p48m69 CA AAA TTG ATA GTG G 

p48w70 AAA ATG ATA GGG GG 

p48w71 A AAA ATG ATA GGG G 

pc48w72 A AAA ATA ATA GGG GGA 



AG 




15 


95 


AG 




16 


96 






15 


97 


AG 




16 


98 






14 


99 






15 


100 






15 


101 






15 


102 


ATT 




15 


103 






14 


104 






14 


105 


ATT 




15 


106 


ATT 


GGA 


15 


107 


ATT 


GGA G 


16 


108 


ATT 




15 


109 


ATT 


GG 


14 


110 


ATT 


GGA 


13 


111 






15 


112 






16 


113 






16 


114 






15 


115 






16 


116 






15 


117 






14 


118 






14 


119 






16 


120 



WO 99/67428 



PCT/EP99/04317 



35 



Table 3 - Cont'd-3 





4 O 


4 D 


4 / 


A Q 
4 O 




D\J 


D 1 






^A 


length Seq ID 




7\ ft 

AAA 


Al G 


Gl A 




bun 


7ATT 


PP7A 
bun 


ob 1 


111 


2XTP 






PoOwl 








ppp 




&TT 


vjori 


bb 1 


TTT 
111 




1 Q 


i4i 


T"» C A ■ «. ft 

P50w2 






7\ 

A 


PPP 

GGG 


PP7A 
GGA 


7ATT 
Al 1 


PP7A 
bbA 


PPT 


TTT 
111 




1 Q 


i 00 


P50w3 






1 A 


GGG 


bo A 


Al i 


PP7A 


PPT 


rp mm 
111 






1 0*5 

1Z J 


P50w4 






Tv 

A 


GGG 


GGA 


7\ TT 
Ai 1 


PP74 

GGA 


PPT 

GG1 


TTT 


7\ P 
AG 


o a 
20 


IZh 


P50w5 






TA 


r* r* 
GGG 


GGA 


7\ mm 
AI 1 


PP7A 

GGA 


PPT 

GGT 


TTT 


TV 

AG 




IOC 


P50w6 






GTA 


GGG 


poll 
GGA 


ATT 


GGA 


GGT 


TGG 




19 


126 


P50w7 




G 


GTA 


GGG 


^ TV 

GGA 


7\ m m 

ATT 


TV 

GGA 


GGT 


TGG 




20 


127 


P50w8 






GTA 


GGG 


/""• TV 

GGA 


TV mm 

ATT 


/-«/-» TV 

GGA 


GGT 


mm/** 1 

TTG 




20 


128 


P50w9 






GTA 


GGG 


r* tv 

GGA 


7v mm 

ATT 


f~* f~* 7V 

GGA 


GGT 


TTT 




o ft 

20 


129 


PSOwlO 




TG 


GTA 


GGG 


GGA 


7v m m 

ATT 


TV 

GGA 


GGT 


GG 




20 


130 


p50w21 








GG 


/->» O 7» 

GGA 


TV mm 

ATT 


/^/^ TV 

GGA 


GGT 


TTT ■ 




17 


131 


P50w22 








GG 


TV 

GGA 


tv mm 

ATT 


GGA 


^^m 

GGT 


m m f~+ 

TTG 




16 


132 


P50w23 








GG 


GGA 


ATT 


GGA 


GGT 


TTT 


TV 

AG 


18 


133 


t>50w24 








GG 


GGA 


ATT 


GGA 


GGT 


TG 




15 


134 


P50w25 








G 


GGA 


ATT 


GGA 


GGT 


TTT 


AT 


18 


135 


P50w26 








GG 


GGA 


ATT 


GGA 


GGT 


TTT 




17 


136 


P50mll 








GGG 


GGA 


GTT 


GGA 


GGT 


TTT 




18 


137 


P50ml2 






A 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 




1.9 


138 


P50ml3 


- 




TA 


GGG 


GGA 


mm 

GTT 


GGA 


GGT 


TTT 




20 


n 0 a 

139 


P50ml4 






A 


GGG 


GGA 


GTT 


GGA 


GGT 


TTT 


TV /~* 

AG 


20 


1 A ft 

140 


P50ml5 






TA 


GGG 


GGA 


GTT 


GGA 


/*"» /-^ rri 

GGT 


TTT 


tv 

AG 


21 


141 


P50ml6 






GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TGG 




19 


142 


P50ml7 




G 


GTA 


GGG 


GGA 


GTT 


GGA 


GGT 


TGG 




z u 


14 J 


P50ml8 






GTA 


GGG 


GGA 


/"•mm 
Gi 1 


PP7A 

GGA 


PPT 

GG J. 


TTP 
1 1G 






n ii /i 
144 


P50ml9 






GTA 


GGG 


GGA 


/"•mm 
GTT 


GGA 


PPT 1 

GG1 


TTT 


7ATP 
Alb 


21 


1 yl C 

14o 


P50m20 




TG 


GTA 


GGG 


TV 

GGA 


/■••mm 

GTT 


GGA 


GGT 


GG 




20 


146 


P50m27 








GG 


GGA 


GTT 


GGA 


GGT 


TTG 




1 Q 


14 / 


t> r ft o O 

P50m2o 










PP7V 


pmm 
oil 


PP21 
bbn 


PPT 
bb 1 


TTT 
111 


rib 


18 


148 


P50m29 








GG 


GGA 


GTT 


GGA 


GGT 


TG 




15 


149 


P50m30 








G 


GGA 


GTT 


GGA 


GGT 


TTT 


AT 


18 


150 


p50w31 










GGA 


ATT 


GGA 


GGT 


TTT 




15 


151 


p50w32 








G 


GGA 


ATT 


GGA 


GGT 


TGG 




15 


152 


p50m33 










GGA 


GTT 


GGA 


GGT 


TTT 




15 


153 


p50m34 








G 


GGA 


GTT 


GGA 


GGT 


TGG 




14 


154 


p50m35 








GGG 


GGA 


GTT 


GGA 


G 






13 


155 


p50m36 








GG 


GGA 


GTT 


GGA 


G 






12 


156 


p50m37 








GGG 


GGA 


GTT 


GGA 








12 


157 


p50w38 










GGA 


ATT 


GGG 


GGT 


TTG 




14 


158 


p50W39 










GA 


ATT 


GGG 


GGT 


TTT 




14 


159 
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Table 3 - Cont'd-4 



p50w40 GA ATT GGG GGT TTT AG 15 160 

p 50w41 GGA ATT GGG GGT TG 13 161 

p 50w42 GGA ATT GGG GGT G 12 162 

p 50w 43 GA ATT GGG GGT TG 12 163 

p 50w44 GA ATT GGG GGT TTG 13 164 

p 50w4 5 GGG GGA ATT GCA G 13 165 

GGA ATT GCA GGT TG 14 166 

GGA ATT GCA GGT G 13 167 

GGA ATT GGA GGG TTG 14 168 

P 50w4 9 GA ATT GGA GGG TTG 13 169 

p 50w50 GA ATT GGA GGG TTT 14 170 

p 50w5l GGA ATT GGA GGC TTG 14 171 

p 50w52 . GA ATT GGA GGC TTG 13 172 

p 50w53 GA ATT GGA GGC TTT 14 173 

p50m54 GGA GTT GGA GGT TTG 15 174 

p50 m 55 GA GTT GGA GGT TTT 14 175 



p50w4 6 
p50w47 
p50w48 
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Table 3 - Cont'd-5 



51 52 53 54 55 





GGA 


GGT 


TTT 


ATC 


AAA 


p5 4 wl 




GGT 


TTT 


ATC 


AAA 


nS4 w2 




GT 


TTT 


ATC 


AAA 


S 4 w ^ 

LJ *J T W »J 




GT 


TTT 


ATC 


AAA 


n S 4 w4 
uj 




T 


TTT 


ATC 


AAA 


nS4 wS 

LV T W sJ 




GGT 


TTT 


ATC 


AAA 


«^ " vv \J 




GT 


TTT 


ATC 


AAA 


lj j / 




GGT 


TTT 


GCC 


AAA 


nR4mfi 

L/ »_/ *1 JUVJ 




GT 


TTT 


GCC 


AAA 


^j *j ■iiii^ 




GT 


TTT 


GCC 


AAA 


kJ w> 11L1X v 




T 


TTT 


GCC 


AAA 


W+J IIIIX X 




GGT 


TTT 


GCC 


AAA 


po ^IulZ 




pt 
o i 


TTT 
XXI 


GCC 


AAA 

nnn 


P JtWI J 




gt 

O X 


TTT 

XXX 


ATP 

n X W 


AAG 






ggt 


TTT 

XXX 


ATC 


AAG 


po*i Wl O 


n 


ggt 

UU X 


TTT 

XXX 


ATC 

n x w 


AAG 






O X 


TTT 

XXX 


ATC 

-ii J. W 


AAA 


r^dwl 7 
pj4wl / 






TTT 

XXX 


ATP 
n i >~» 


AAA 


pO <i W J. 0 


A 
n 


ppp 


TTT 

XXX 


ATP 

n X w> 


AAA 

nnn 


po h wx y 


A 
n 


ggp 

O 


TTT 

J. X X 


ATC 


AAA 


pOfJIIUc U 


A 


PPT 

OO X 


TTT 
XXX 


ATT 
n 1 x 


AAA 

nnn 


P 0 H IUZ X 




GGT 

OO X 


TTT 

XXX 


ATT 


AAA 


po4 wz^ 


pa 

on 


PPT 


TTT 
111 


ATT 
nl X 


AAA 
nnn 




Vj/i 


PPT 
oul 


111 


ATT 
nl 1 


AAA 
nnn 


pOftlRZ J 




PPT 
OO 1 


TTT 

XXX 


ATT 

■Ti. 1 X 


GGT 

WW X 






PPT 
oo 1 


TTP 


ATT 
nl X 


AAG 
nnu 


po f*mzo 




PPT 
oo 1 


TTP 


ATT 
ni x 


AAG 


pO*s WZ D 




PPT 

OOl 


TTP 


ATT 

n X X 


AAG 


po finu: d 


A 
n 


GGT 


TTP 
i i w 


ATT 

n X X 


AAG 


pOf4WZ / 




PPT 
oo x 


TTT 

XXX 


ATT 
ni 1 


AAG 


p JHUZ / 




ggt 


TTT 

XXX 


ATT 

ii X X 


AAG 


nS4m9 ft 


A 


ggt 


TTT 

i J. X 


ATT 

*l X X 


AAG 


pO filllZ Z7 


ga 

on 


PPT 

X 


TTT 

XXX 


ATT 

n x x 


AAG 


d54iti30 




GGT 


TTT 


ATT 


AAG 


p54w31 




GGT 


TTT 


ATC 


AAA 


p54w32 


A 


GGT 


TTT 


ATC 


AAA 


p54w33 


A 


GGT 


TTT 


ATC 


AAA 


p54w34 


GA 


GGT 


TTT 


ATC 


AAA 


p54m35 




GGT 


TTT 


GTC 


AAA 


p54m36 




GGT 


TTT 


GTC 


AAA 


p54m37 




GGT 


TTT 


GTC 


AGA 


p54m38 




GGT 


TTT 


GTC 


AGA 


p54w39 




GGG 


TTT 


ATC 


AAA 


p54w40 




GGG 


TTT 


ATC 


AAA 


p54w41 




GGC 


TTC 


ATC 


AAA 


p54w42 


GA 


GGC 


TTC 


ATC 


AAA 


p54m48 




GGT 


TTT 


GTC 


AAA 


p54m4 9 




GT 


TTT 


GTC 


AGA 



56 57 58 length Seq ID 
GTA AGA CAG 



GTA A 


16 


176 


GTA AG 


16 


177 


GTA AGA 


17 


178 


GTA AGA 


16 


179 


GTA 


15 


180 


GTA 


15 


181 


GTA 

W X ** 


15 


182 


GTA A 

W X ** 


15 


183 


GTA AG 

w x *i nvj 


16 

X V 


184 


GTA AGA 
oin nun 


16 

X \J 


185 

■L U J 


GT 
O X 


14 

X * 


186 

1 U v 


GTA 
o x n 


14 


187 

X U r 


GTA AA 
uin nn 


16 

X vj 


188 

X U VJ 


PTA A 
Oln n 


X o 


189 


GTA 
Oin 


16 

X w 


190 

X Z* VJ 


GTC APA 
Olu nun 


1 7 
x / 


191 

X 7 X 


GTC AGA C 
o x w nun 


16 

X V 


192 


CTA A 
oin n 


17 


193 


ftTA 
VwJ x n 


16 

X v 


194 


GTA A 
win n 


17 

X I 


195 

X *J 


PTA AP 
win nw 


17 

X ' 


196 

X «✓ w 


f^TA 
win 


17 

X 1 


197 

X -? / 


PTA 
uln 


17 

X f 


198 

X ^7 VJ 


TTT AT 
XXX nX 


1 6 

X VJ 


199 


PTA 
win 


X 


200 


PTA A 
Kd In n 


1 6 

X u 


201 

X. VJ X 


GTA 
win 


16 

X V 


202 


GTA 
oin 


16 

X \J 


203 


PTA A 
win n 


16 

X VJ 


204 

X U 1 


GTA A 
w x n n 


16 

X \J 


205 

X. VJ %J 


GTA 

win • 


16 

X w 


206 


GT 


1 6 

X vj 


207 

X> VJ / 


GTA AG 


17 


208 


GTA A 


16 


209 


GTA A 


17 


210 


GTA 


16 


211 


GT 


16 


212 


GTA 


15 


213 


GTA A 


16 


214 


GTA 


15 


215 


GTA A 


16 


216 


GTA 


15 


217 


GTA A 


16 


218 


GT 


14 


219 




14 


220 


GT 


14 


221 


GTA 


14 


222 
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Table 3 - Cont'd-6 



p54m50 GGT TTT GTC AGA 

p54w51 A- GGT TTA ATC AAA 

p54w52 GA GGT TTA ATC AAA 

p54m53 GGT TTT ACC AAA 

p54m54 GGT TTT ACC AAA 



GT 


14 


223 


GTA 


16 


224 


GT 


16 


225 


GTA 


15 


226 


GT 


14 


227 
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Table 3 - Cont'd-7 





78 


79- 


80 


81 


82 


83 


84 


85 


86 


87 


length Seq ID 




GGA 


CCT 


ACA 


CCT 


GTC AAC 


ATA 


ATT 


GGA 


AGA 






P82wl 




CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


AG 






19 


228 


P82w2 




CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


ATG 






20 


229 


P82w3 




CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


ATT 






21 


230 


P82w4 


A 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


AG 






20 


231 


P82w5 


A 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


ATG 






21 


232 


P82w6 


A 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 








19 


233 


P82w7 


GA 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 








20 


234 


P82w8 






CA 


CCT 


GTC 


AAC 


ATA 


ATT 


GGA 




20 


235 


P82w9 






A 


CCT 


GTC 


AAC 


ATA 


ATT 


GGA 


A 


20 


236 


P82wl0 






ACA 


CCT 


GTC 


AAC 


ATA 


ATT 


GG 




20 


237 


P82W21 






A 


CCT 


GTC 


AAC 


ATA 


ATT 


GGA 




19 


238 


P82mll 




CCT 


ACA 


CCT 


ACC 


AAC 


ATA 


AG 






19 


239 


P82ml2 




CCT 


ACA 


CCT 


ACC 


AAC 


ATA 


ATG 






20 


240 


P82ml3 




CCT 


ACA 


CCT 


ACC 


AAC 


ATA 


ATT 






21 


241 

C T JL 


P82ml4 


A 


CCT 


ACA 


CCT 


ACC 


AAC 


ATA 


AG 






20 


242 


P82ml5 


A 


CCT 


ACA 


CCT 


ACC 


AAC 


ATA 


ATG 






21 


243 

C* 1 «J 


P 8 2ml 6 


A 


CCT 


ACA 


CCT 


ACC 


AAC 


ATA 








19 


244 


P82ml7 


GA 


CCT 


ACA 


CCT 


ACC 


AAC 


ATA 








20 


245 


P82ml8 






CA 


CCT 


ACC 


AAC 


ATA 


ATT 


GGA 




20 


246 


P82ml9 






A 


CCT 


ACC 


AAC 


ATA 


ATT 


GGA 


A 


20 


247 


P82m20 






ACA 


CCT 


ACC 


AAC 


ATA 


ATT 


G 




19 


248 

Cm "J \J 


P82m22 




CCT 


ACA 


CCT 


TTC 


AAC 


ATA 


ATT 






21 


249 


P82m23* 




CCT 


ACA 


CCT 


GCC 


AAC 


ATA 


ATT 






21 


250 


P82m24 




CCT 


ACA 


CCT 


TCC 


AAC 


ATA 


ATT 






21 


251 

Cm -J JL 


P82m25 






A 


CCT 


TTC 


AAC 


ATA 


ATT 


GGA 


A 


20 


252 

Cm +J Cm 


P82m26 






A 


CCT 


GCC 


AAC 


ATA 


ATT 


GGA 


A 


20 


253 

Cm *J — > 


P82m27 






A 


CCT 


TTC 


AAC 


ATA 


ATT 


GGA 


A 


20 


254 


P82m28 






A 


CCT 


ACC 


AAC 


ATA 


ATT 






16 


255 
*j *j 


P82m29 






A 


CCT 


TTC 


AAC 


ATA 


ATT 


GGA 




19 


256 

Cm+J \J 


P82m30 






A 


CCT 


GCC 


AAC 


ATA 


ATT 


GGA 




19 


257 

Cm %J f 


P82m31 






A 


CCT 


TCC 


AAC 


ATA 


ATT 


GGA 




19 


258 

£m +J O 


P82w32 




T 


ACA 


CCT 


GTC 


AAC 


AT 








15 


259 

Cm *J J7 


P82w33 




T 


ACA 


CCT 


GTC 


AAC 


ATA 








16 


260 


P82w34 






ACA 


CCT 


GTC 


AAC 


ATA 








15 


261 


P82w35 






CA 


CCT 


GTC 


AAC 


ATA 








14 


262 


P82m36 






ACA 


CCT 


ACC 


AAC 


ATA 








15 


263 


P82m37 






CA 


CCT 


ACC 


AAC 


ATA 








14 


264 


P82m38 






ACA 


CCT 


TTC 


AAC 


ATA 








15 


265 


P82m39 






CA 


CCT 


TTC 


AAC 


ATA 








14 


266 


P82m40 






ACA 


CCT 


GCC 


AAC 


ATA 








15 


267 


P82m41 






CA 


CCT 


GCC 


AAC 


ATA 








14 


268 


P82w42 






CA 


CCT 


GTC 


AAC 


GTA 








14 


269 


P82w43 






CA 


CCT 


GTC 


AAC 


GT 








13 


270 


P82w44 




CCT 


ACA 


CCT 


GTC 


AAC 










15 


271 


P82w45 




T 


ACG 


CCT 


GTC 


AAC 


AT 








15 


272 


P82w46 




CT 


ACG 


CCT 


GTC 


AAC 


AG 








15 


273 


P82m47 






ACA 


CCT 


TCC 


AAC 


ATA 








15 


274 
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Table 3 - Cont'd-8 



P82m48 






CA 


CCT 


TCC 


AAC 


ATA 






1 A 


01 R 


P82m49 






APA 


PPT 


TPP 


AAP 
Pxrrt*w 


AT 






1 A 


Z. f O 


PR 2m SO 






APA 

PVV-»rt 


PPT 


ATP 


AAP 


ATA 






1 ^ 

X 0 


Oil 
/.II 


P82mS1 






PA 


PPT 


ATP 


AAP 
rxri*w 


ATA 


AG 






01 Q 
Z / o 


IT O £.1U<J £. 






PA 


PPT 


ATP 


AAP 


ATA 


ATG 




1 D 


01 d 

z / y 


t 0 ZUl«J o 






A 
r\ 


PPT 


ATP 


AAP 


ATA 


ATG 




1 c; 
ID 


zoU 


irO Z W 3** 








CC1 


•olC 


A Ap 


ATA 


ATT 




Id 


001 
zol 


it 0 z w o o 








CC Jl 


Gi i 


a. 

AAC 


ATA 


ATT 


p 

G 


16 


o o o 

zoz 


ro^WO 0 






A 
A 


p*p»m 

CCT 


Gl I 


AAP 

AAC 


ATA 


ATG 




15 


O O D 

283 


DO Oi.iRT 








ppp 

CCG 


prpp 

GJ.C 


AAP 

AAC 


ATA ATT 




15 


284 


ro^WOo 






7\ pp 
ACG 


ppm 

CCI 


pmp 

G1C 


A A r"» 
AAC 


AT 






14 


285 










pi pi m 

CCT 


prpp 

GTC 


TV TV m 

AAT 


ATA 


ATT 




15 


Art r 

286 


ro^WOU 






CA 


pi pi rp 

CCT 


prpp 

GTC 


a a m 

AAT 


ATA 


ATG 




16 


287 


rozwol 






7\p7A 

ACA 


ppm 

CCT 


prpp 

G1C 


A A rp 

AAT 


ATA 


AG 




16 


r* 0 rt 
288 


DQOm£0 








ppm 

CCT 


ppp 
GCC 


A A m 

AAT 


ATA 


ATT 




15 


O ft ft 

289 








r* a 
CA 


ppm 

CCT 


ppp 

GCC 


a a m 
AAT 


ATA 


AG 




15 


290 


DQ Ow C ^ 








P* prn 

CCT 


7\ p p 

ACC 


TV TV P 

AAC 


GTA 


ATT 




15 


291 


no Om £ c 
ro zlUba 








/"i /*i rp 

CCT 


TV PP 

ACC 


TV TV P 

AAC 


GTA 


ATG 




14 


292 


tr ozrribb 






CA 


CCT 


7\ p p 

ACC 


TV TV P 

AAC 


GTA 






14 


293 


Pozmb / 






7\ P TV 

ACA 


r» /-i rn 

CCT 


»/iri 

ACC 


TV TV 

AAC 


GT 






14 


294 


Pozmbo 








CCT 


mm /~» 

TTC 


TV TV 

AAC 


GTA 


ATT 




15 


295 


Pozmby 






CA 


o m 

CCT 


TTC 


AAC 


GTA 


AG 




15 


296 


D O Owl A 

Pozitl / U 






TV p TV 

ACA 


CCT 


mmp 

TTC 


TV TV 

AAC 


GTA 






15 


297 


DQ OmT 1 

r oZITl / 1 






A 


ppm 

CCT 


mmp 

TTC 


TV TV P* 

AAC 


GTA 


ATG 




15 


298 


pozw / Z 








Pm 

CT 


pmp 

GTC 


TV TV m 

AAT 


ATA 


ATT 


G 


15 


299 


pozw / J 








ppm 

CCT 


pmp 

GTC 


TV TV m 

AAT 


ATA 


ATT 


G 


16 


300 


pozW / 4 






A 


pi pirn 

CCT 


pmp 

GTC 


TV TV m 

AAT 


ATA 


ATT 




16 


301 


pozw / 0 








p» m 

CT 


pmp 

GTC 


a a m 

AAT 


ATA 


ATT 


GG 


16 


302 


pozw / b 




CCT 


7\ pp 

ACG 


ppm 

CCT 


pmp 

GTC 


TV TV 

AA 








14 


303 


pozw / / 




p rn 

CT 


7\ pip* 

ACG 


ppm 

CCT 


pmp 

GTC 


TV TV P 

AAC 








14 


304 


pozW / o 


A 


CCT 


TV P'P* 

ACG 


prim 

CCT 


pmp 

GTC 


TV TV 

AA 








15 


305 


pozw / y 


A 


p P m 

CCT 


7\ P'P* 

ACG 


P'P* rn 

CCT 


pmp 

GTC 


A 








14 


306 


«. o ft , , o ft 

pozWoU 




m 

T 


A P* A 

ACA 


ppp 

CCG 


pmp 

GTC 


TV TV P 

AAC 


A 






14 


307 


pozWo 1 




p m 

CT 


7\ P TV 

ACA 


ppp 

CCG 


pmp 

GTC 


TV TV 

AA 








13 ' 


308 


pozWoz 




prim 

CCT 


7\ pi 7\ 

ACA 


ppp 

CCG 


pmp 

GTC 


A 








13 


309 


poZwo J 






P 7\ 

CA 


p«p* rn 

CCT 


pmp 

GTC 


TV TV r* 

AAC 


ATA 


A 




15 


310 


pozwb4 






A 


CCT 


pmp 

GTC 


TV TV r* 

AAC 


ATA 


AT 




15 


311 


pozWoO 




CT 


7\ P* A 

ACA 


CCT 


pmp 

GTC 


TV TV r> 

AAC 


A 






15 


312 


poZWoo 






TV P TV 

ACA 


p* prn 
CCI 


pmp 
GIC 


AAP* 

AAC 


AT 






1 A 

14 


313 


r\R 9uiP. *7 
pOiWO / 






A 


PPT 




AAC 


ATA 


ATT 


p 
G 


1 / 


314 


p82w88 






CA 


CCT 


GTT 


AAC 


ATA 


AG 




15 


315 


p82w89 






ACA 


CCT 


GTT 


AAC 


ATA 


AG 




16 


316 


p82w90 






TCA 


CCT 


GTC 


AAC 


ATA 






14 


317 


p82w91 






ACA 


CCT 


GTC 


AAC 


ATA 


A 




16 


318 


p82w92 






CA 


CCT 


GTC 


AAC 


ATA 


AT 




16 


319 


p82w93 








CCT 


GTC 


AAC 


ATA 


ATT 




15 


320 


p82w94 






A 


CCT 


GTC 


AAC 


ATA 


ATT 




16 


321 


p82w95 








CCT 


GTC 


AAC 


ATA 


ATT 


G 


16 


322 


P82w96 




CCT 


ACA 


CCT 


GTC 


AA 








14 


323 


p82w97 








T 


GTC 


AAC 


ATA 


ATT 


GG 


15 


324 


p82w98 








T 


GTC 


AAC 


ATA 


ATT 


GGA 


16 


325 
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Table 



p82m99 




ACA 


CCT 


TTC 


p82ml00 


T 


ACA 


CCT 


TTC 


p82ml01 




ACA 


CCT 


ATC 


P82ml02 




ACA 


CCT 


ATC 


p82ml03 




CA 


CCT 


GCC 


p82ml04 




ACA 


CCT 


GCC 


p82ml05 




ACG 


CCC 


TTC 


p82ml06 




CG 


CCC 


TTC 


p82ml07 


T 


ACG 


CCC 


TTC 


p82wl08 


CT 


ACA 


CCG 


GTC 


p82wl09 


CCT 


ACA 


CCG 


GTC 


p82wll0 




A 


CCG 


GTC 


p82wlll 




A 


CCG 


GTC 


p82wll2 


CT 


ACA 


CCA 


GTC 


p82wll3 


CT 


ACA 


CCA 


GTC 


p82wll4 




ACA 


CCA 


GTC 


p82wll5 




ACA 


CCA 


GTC 


p82wll6 


T 


ACG 


CCT 


GTC 


p82wll7 




ACG 


CCT 


GTC 


p82wll8 


T 


ACG 


CCT 


GTC 


p82mll9 


CCT 


ACA 


CCT 


TTC 


p82ml20 


CT 


ACA 


CCT 


TTC 


p82ral21 


A CCT 


ACA 


CCT 


TTC 


p82wl22 




ACG 


CCT 


GTC 


p82wl23 


T 


ACG 


CCT 


GTC 


p82wl24 




CG 


CCT 


GTC 


p82ml25 


T 


ACA 


CCT 


TTC 


p82ml26 




ACA 


CCT 


TTC 


p82ml27 




CA 


CCT 


TTC 


p82ml28 




A 


CCT 


TTC 


p82ol29 








C 


p82ol30 








C 
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3 - Confd-9 



AAC 


ATA 


A 






16 


326 


AAC 


ATA 








16 


327 


AAC 


ATA 


ATG 






17 


328 


AAC 


ATA 


AG 






16 


329 


AAT 


ATA 


ATG 






16 


330 


AAT 


ATA 


AG 






16 


331 


AAC 


ATA 








15 


332 


AAC 


ATA 


AG 






15 


333 


AAC 


AT 








15 


334 


AAC 










14 


335 


AA 










14 


336 


AAC 


ATA 


ATG 






15 


337 


AAC 


ATA 


ATT 






16 


338 


AAC 










14 


339 


AAC 


A 








15 


340 


AAC 


ATA 








15 


341 


AAC 


ATA 


AG 






16 


342 


AAC 


AT 








15 


343 


AAC 


ATA 








15 


344 


AAC 


A 








14 


345 


AAC 










15 


346 


AAC 










14 


347 


AA 










15 


348 


AAC 


ATA 


AGG 






16 


349 


AAC 


ATA 








16 


350 


AAC 


ATA 


AGG 






15 


351 


AAC 


GTA 








16 


352 


AAC 


GTA 


AGG 






16 


353 


AAC 


GTA 


ATG 






16 


354 


AAC 


GTA 


ATT 






16 


355 


AAC 


GTA 


ATT 


GGA 


AGA 


16 


356 


AAC 


GTA 


ATT 


GGA 


AG 


15 


357 
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Table 3 - Cont'd-10 



86 87 88 89 90 91 92 93 94 length Seq ID 
GGA AG A' AAT CTG TTG ACT CAG ATT GGT 



P90wl 




A 


AAT 


CTG 


TTG ACT 


CAG 






16 


358 


P90w2 




GA 


AAT 


CTG 


TTG ACT 


CAG 






17 


359 


P90w3 




GA 


AAT 


CTG 


TTG ACT 


CAG 


AGG 




18 


360 


P90w4 




A 


AAT 


CTG 


TTG ACT 


CAG 


AGG 




17 


361 


P90w5 




AGA 


AAT 


CTG 


TTG' ACT 


CAG 


AGG 




19 


362 


P90w6 




AGA 


AAT 


CTG 


TTG ACT 


CAG 


ATG 




20 


363 


P90w7 




AGA 


AAT 


CTG 


TTG ACT 


CAG . 


ATT 




21 


364 


P90w8 


AGA AAT CTG TTG, ACT CAG ATTGG 




20 


365 


P90w9 


GA 


AGA 


AAT 


CTG 


TTG ACT 


CAG 


AGG 




21 


366 


P90wl0 


A 


AGA 


AAT 


CTG 


TTG ACT 


CAG 


ATG 




21 


367 


P90mll 




AGA 


AAT 


CTG 


ATG ACT 


CAG 


ATG 




20 


368 


P90ml2 




AGA 


AAT 


CTG 


ATG ACT 


CAG 


ATT 




21 


369 


P90ml3 


A 


AGA 


AAT 


CTG 


ATG ACT 


CAG 


AGG 




20 


370 


P90ml4 ' 


GA 


AGA 


AAT 


CTG 


ATG ACT 


CAG 


AGG 




21 


371 


P90ml5 


A 


AGA 


AAT 


CTG 


ATG ACT 


CAG 


ATG 




21 


372 


P90ml6 


GA 


AGA 


AAT 


CTG 


ATG ACT 


CAG 


ATT 




20 


373 


P90ml7 . 


GGA 


AGA 


AAT 


CTG 


ATG ACT 


CAG 






21 


374 


P90ml8 


A 


AGA 


AAT 


CTG 


ATG ACT 


CAG 






19 


375 


P90ml9 




A 


AAT 


CTG 


ATG ACT 


CAG 


ATT 


GG 


21 


376 


P90m20 




A 


AAT 


CTG 


ATG ACT 


CAG 


ATT 


G 


20 


377 


P90m21 




A 


AAT 


CTG 


ATG ACT 


CAG 


CTT 


G 


20 


378 


P90m22 




A 


AAT 


CTG 


ATG ACT 


CAG 


CTT 




19 


379 


P90m23 






AAT 


CTG 


ATG ACT 


CAG 


CTT 


G 


18 


380 


P90w24 




A 


AAT 


CTG 


TTG ACT 


CAG 


CTT 


G 


20 


381 


P90w25 




A 


AAT 


CTG 


TTG ACT 


CAG 


CTT 




19 


382 


P90w26 






AAT 


CTG 


TTG ACT 


CAG 


CTT 


G 


19 


383 


P90w27 






AAT 


CTG 


TTG ACT 


CA 






14 


384 


P90w28 






AAT 


CTG 


TTG ACT 


CAG 






15 


385 


P90w29 




A 


AAT 


CTG 


TTG ACT 


CA 






15 


386 


P90w30 




A 


AAT 


CTG 


TTG ACT 


CAG 






16 


387 


P90m31 






AAT 


CTG 


ATG ACT 


CA 






14 


388 


P90m32 






AAT 


CTG 


ATG ACT 


CAG 






15 


389 


P90m33 




A 


AAT 


CTG 


ATG ACT 


CA 






15 


390 


P90m34 




A 


AAT 


CTG 


ATG ACT 


CAG 






16 


391 


P90w35 




GA 


AAT 


CTG 


TTG ACT 


C 






15 


392 


P90w36 




GA 


ACT 


CTG 


TTG ACT 


C 






15 


393 


P90w37 






T 


CTG 


TTG ACT 


CAG 


ATG 




15 


394 


P90w38 




GA 


AAT 


CTG 


TTG ACT 


C 






15 


395 


P90w39 




GA 


ACT 


CTG 


TTG ACT 


C 






15 


396 


P90w40 




A 


AAT 


CTG 


TTG ACT 


CA 






15 


397 


P90w41 






AAT 


CTG 


TTG ACT 


CAG 






15 


398 


P90m42 






AAT 


CTG 


ATG ACT 


CAG 






15 


399 


P90m43 




A 


AAT 


CTG 


ATG ACT 


CA 






15 


400 


P90w44 






AT 


CTG 


TTG ACT 


CAG 


AG 




15 


401 


P90w45 








CTG 


TTG ACT 


CAG 


ATT 




15 


402 


P90w46 




AGA 


AAT 


CTG 


TTG ACT 








15 


403 


P90m47 






AT 


CTG 


ATG ACT 


CAG 


AG 




15 


404 
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TabIe3-Cont'd-ll 



P90m48 
P90m49 
P90w50 
P90w51 
P90w52 
P90w53 
P90w54 
P90w55 
P90m56 
P90m57 
P90w58 
P90w59 
P90w60 
P90w61 
P90m62 
P90m63 
P90w64 
P90w65 
P90w66 
P90w67 
,P90w68 
. P90w69 
p90w70 
p90w71 
p90w72 
p90w73 
p90w74 
p90w75 
p90w76 
p90w77 
p90w78 
p90w79 
p90w80 
p90w81 
p90ra82 
p90m83 
p90m84 
p90m85 
p90m8 6 
p90m87 
P90w88 
P90w89 
p90w90 
p90wl00 
p90m92 
p90m93 
p90ra94 
p90m95 
p90m96 



CTG ATG ACT CAG 


ATT 




15 


405 


APA AAT CTG ATG ACT CA 

rturt xxx* X w mvj xxw x w*» 






17 


406 


AAT ATG TTG ACT CAG 

ni* x n x w x x w xxw x wxxw 






15 


407 


PA AAT ATG TTG ACT CA 

wn xxx* x n x w a i\j xxw x 






16 


408 


AAT TTG TTG ACT CAG 

nni x x \j x x w xxw x wx *w 






15 


409 


GA AAT TTG TTG ACT CA 

\jj n iin x xx vj x x >«j * *w w* a 






16 


410 


AAT ATG TTG ACC CAG 

xxxx X xl X VJ X X W X^W W WX *w 






15 


411 


A AAT ATG TTG ACC CA 

t\ /ui x xi x w x x w w wx * 






15 


412 


AAT ATG ATG ACC CAG 
rxxx X x* x w n x w iiv.'V v»*v 






15 


413 


iv rac ATP ATP APP PA 
/x bnb nio nio ttbb wn 






15 


414 


AAP HTC TTP APT PAfi 
/inL nib lib rib 1 brlb 






15 


415 


7\ AAP ATP TTP APT PAH 
A /x/\b nib lib /xbl b/lb 






15 

X %J 


416 


TP rnmr APT PAft 


PTT 

w X X 




14 


417 


PTH TTf^ APT PAft 
Liu lib n\y X V^nu 


PTG 




14 

X x 


418 


pt 7at^ apt par; 


PTT 

b X X 




14 

X. "X 


419 


prnr ATP APT PAP 
Lib nl u rib 1 brib 


p_p 

b \3 




14 

x ~ 


420 


fpr T\pf APA PAP 
1 b rib 1 n^n L-nu 


PTT 
b 1 1 




14 

X x 


421 


pfpr TTP APA PAP 
bio lib ribri. brib 


p_p 

b b 




14 

x, "a 


422 


7\ TV rn /-""TP TTC APA PAP 
AA 1 Lib. lib rVbrl br\b 






15 

X «J 


423 


7\ 7\ C* r*TP TTP APT PA 
Arxb bib lib rib 1 bri 






13 

X -J 
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7V 7\7\/*" pfpp TTP APT P 

A AAb bib lib AL 1 b 






13 

X «J 


425 
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GA AAb Lib lib /lb 1 
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X «J 
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P TTP APT PAP 
b lib Ab 1 brib 
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r\ 1 1 


PPP 
bbb 


15 


429 


TP TTP APA PAP 
lb lib ribri brib 


PTT 
b 1 1 


P 
b 


15 


430 


PTP TTP APA PAP 
bib lib r»bri brib 


PTT 

w X X 




15 

X \J 


431 


P TTG APA PAG 


PTT 

W X X 


GGG 

WWW 


15 


432 


TP TTP APT PAG 
1 w lib rxw X v-nvj 


PTT 

W X X 


G 

w 


15 


433 


P TTft APT PAG 
\j i x b no x ^nu 


ATG 

n x w 




15 


434 


P TTP APT PAft 
kj lib riL x ^rtvj 


CTT 

V— ' X X 


p 

w 


14 


435 


TP TTP APP PAft 
lb lib nbb wriw 


ATT 

xx X X 


p 

w 


15 


436 


P TTP APP PAP 
b lib nbb wr\w 


ATT 
n i x 


p 

w 


14 


437 


P TTP APP PAP 
b lib ribb b/xb 


ATT 


www 


15 


438 


TP ATP APT PACi 
lb nib nL. x Lr\w 


ATT 
n x x 


G 

w 


15 


439 


TG ATG ACT CAG 

X n— j xx x vj nv x V— ^ a 


ATT 


GGG 


16 


440 


G ATG ACT CAG 


ATT 


GGG 


15 


441 


G ATG ACT CAG 


ATT 


GGT 


16 


442 


CTG ATG ACT CAG 


CTT 




15 


443 


TG ATG ACT CAG 


CTT 


G 


15 


444 


A AAT CTG TTG ACT CA 






15 


445 


A AAT CTG TTG ACT CA 






15 


446 


A AAT CTG TTG ACT CA 






15 


447 


AAT CTG ATG ACT CAG 






15 


448 


A AAT CTG ATG ACT CA 






16 


449 


GA AAT CTG ATG ACT C 






15 


450 


CTG ATG ACT CAG 


ATG 




15 


451 


AGA AAT ATG ATG 






15 


452 


A AGA AAT ATG ATG ACT 






16 


453 
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Table3-Cont'd-12 



p90m97 A AGA AAT CTG ATG 

p90m98 A AGA AAT ATA ATG 

p90m99 A AAT ATA ATG 

p 9 0ml 00 AAT ATG ATG 

p 9 0ml 01 AAC CTG ATG 

p 9 0ml 02 AGA AAT TTG ATG 

p90ml03 A AAT TTG ATG 

p90ml04 AC CTG ATG 

P 90ml05 AAT CTG ATG 

p90ml06 AT CTG ATG 

P 90ml07 AT CTG ATG 

P 90ml08 CTG ATG 

p 9 0ml 09 AGA AAT CTG ATG 

p 9 0ml 10 AGA AAT CTG ATG 

p 9 0ml 11 GA AGA AAT CTG ATG 

p 9 0ml 12 GGA AGA AAT CTG ATG 

p 9 0ml 13 GA AGA AAT CTG ATG 

p 9 0ml 14 AGA AAT CTG ATG 

p90wll5 AAT CTG TTA 

p90wll6 T CTG TTA 

p90wll7 AT CTG TTA 

p90wll8 AGA AAT TTG TTG 

P 90wll9 GA AAT TTG TTG 

p90wl20 AAT TTG TTG 



7\pT 






16 


454 








J. D 


455 

•1 <U 


"ROT 1 


pup 




1 6 


456 
*i *j \j 




PZiP 
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1 5 
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AC I 
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1 6 
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"S *J J 








1 6 
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■± \J V/ 


ACT 


CAG 




1 A 


4 61 


ACT 


CAG 


A 


1 £ 
1 D 


ft 


ACT 


CAG 


ATG 


ID 


4 6^ 
fl DO 


ACT 


CACj 




1 A 


4 64 


ACT 


CAG 


ATT G 


16 


465 


ACT 


C 




16 


466 


ACT 






15 


4 67 


A 






15 


468 


A 






16 


469 


AC 






16 


470 


AC 






14 


471 


ACT 


CAG 




15 


472 


ACT 


CAG 


ATT 


16 


473 


ACT 


CAG 


AG 


15 


474 


ACT 






16 


475 


ACT 


C 




15 


476 


ACT 


CAG 




15 


477 
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Table 6 



p30 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 


w25 


98.9 


99.4 


88.9 


98.3 


94.3 


100.0 


97.0 


w29 


2.5 


0.6 


0.0 


1.7 


0.0 


0.0 


0.0 


w32 


3.3 


0.6 


5.6 


5.2 


5.7 


6.7 


1.5 


w36 


2.5 


0.0 


0.0 


3.4 


0.0 


0 . 0 


1 . 0 


m?3 


3.1 


0.0 


0.0 


0.0 


0.0 


0.0 


1.0 




0. 6 


0.6 


5.6 


0.0 


0.0 


0.0 


1.0 


r>4 6/48 


USA 


France 


U.K. 


Brazil 


Spain 


Luxemb . 


Belgium 


w47 


94 . 2 


80. 5 


83.3 


89.7 


97.1 


73.3 


82.9 


w45 


8.6 


15.6 
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1.7 


5.7 


6.7 


11.1 


w72 


4.2 


0.0 


16.7 


0.0 


2.9 


13.3 
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0.0 


10.3 
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Belgium 
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91.6 


93.5 


94.4 


77.6 
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5.5 
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0.5 
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3.0 
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Tabie 6 - Cont'd 
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0 . 0 
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O . / 


u . u 
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0 . 0 


u . u 
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0.0 
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U • u 
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m63 


0.3 


0.0 


0.0 


1.7 


2.9 


13. 3 
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0.0 


0.0 


0.0 


2.9 
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23.8 
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13.6 


11.1 


21.9 




w52 


8.4 


11.2 


5.7 


6.8 


13.9 


4.8 




w69 


5.2 


1.4 


5.7 


2.3 


0.0 


3.8 




w73 


6.1 


9.1 
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0.0 


8.3 


6.7 




w79 
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11.2 


8.6 
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Table 7 

Tm lengte Seq ID 



pc50w5 


AGG 


GGG AAT 


TGG 


AGG 


TTT 


TA 










20 


511 




26 


27 


28 


29 


30 


31 


32 


33 


34 


35 










ACA 


GGA 


GCA 


GAT 


GAT 


ACA 


GTA 


TTA 


GAA 


GAA 








pc30w25 






GCA 


GAT 


GAT 


ACA 


GT' 








40 


14 


31 


pc30w29 




A 


GCG 


GAT 


GAT 


ACA 










36 


13 


. 35 


pc30w32 






GCA 


GAT 


GAC 


ACA 


GT 








42 


14 


38 


pc30w36 






GCA 


GAC 


GAT 


ACA 


GG 








40 


14 


42 


pc30m23 




A 


GCA 


GAT AAT 


ACA 


GT 








40 


15 


29 




44 


45 


46 


47 


48 


49 


50 


51 


52 












CCA 


AAA 


ATG 


ATA 


GGG 


GGA 


ATT 


GGA 


GGT 










pc48w37 






ATG 


ATA 


GGG 


GGA 


ATT 










15 


512 


pc48w47 




AAA 


ATG 


ATA 


GGG 


GGA 










42 


15 


93 


pc48w73 


A 


AGA 


ATG 


ATA 


GGG 


G 












14 


513 


pc48w45 




AAA 


ATG 


ATA 


GGA 


GGA 


ATT 








42 


18 


91 


pc48w72 


A 


AAA 


ATA 


ATA 


GGG 


GGA 










42 


16 


120 


pc48m41 






ATG 


ATA 


GTG 


GGA 


ATT 








40 


15 


87 





48 


49 


50 


51 


52 


53 


54 










GGG 


GGA 


ATT 


GGA 


GGT 


TTT 


ATC 








pc50w31 




GGA 


ATT 


GGA 


GGT 


TTT 




42 


15 


151 


pc50w44 




GGA 


ATT 


GGG 


GGT 


TT 




42 


14 


164 


pc50w52 




GA 


ATT 


GGA 


GGC 


TTG 






14 


172 


pc50m37 


GGG 


GGA 


GTT 


GGA 








40 


12 


157 




51 


52 


53 


54 


55 


56 


57 58 










GGA 


GGT 


TTT 


ATC 


AAA 


GTA 


AGA CAG 








pc54w34 


GA 


GGT 


TTT 


ATC 


AAA 


GT 




42 


16 


212 


pc54wl4 




GGT 


TTT 


ATC 


AAG 


GTA 


A 


42 


16 


189 


pc54wl9 


A 


GGC 


TTT 


ATC 


AAA 


GTA 




42 


16 


194 


pc54w22 


GA 


GGT 


TTT 


ATT 


AAA 


GTA 




42 


17 


197 


pc54w26 


A 


GGT 


TTC 


ATT 


AAG 


GTA 




42 


16 


202 


pc54w27 




GGT 


TTT 


ATT 


AAG 


GTA 


A 


40 


16 


204 


pc54m35 




GGT 


TTT 


GTC 


AAA 


GTA 




40 


15 


213 


pc54m37 




GGT 


TTT 


GTC 


AGA 


GTA 




42 


15 


215 


pc54m55 


A 


GGT 


TTT 


GCC 


AAA 


GT 






15 


516 





78 


79 


80 


81 


82 


83 


84 


85 


86 


87 










GGA 


CCT 


ACA 


CCT 


GTC 


AAC 


ATA 


ATT 


GGA 


AGA 








pc82w91 






ACA 


CCT 


GTC 


AAC 


ATA 


A 






44 


16 


318 


pc82w60 






CA 


CCT 


GTC 


AAT 


ATA 


ATG 






42 


17 


287 


pc82wlll 






A 


CCG 


GTC 


AAC 


ATA 


ATT 






44 


16 


338 


pc82w89 






ACA 


CCT 


GTT 


AAC 


ATA 


AG 






42 


17 


316 


pc82ml01 






ACA 


CCT 


ATC 


AAC 


ATA 


AT 








17 


517 


pc82w42 






CA 


CCT 


GTC 


AAC 


GTA 








42 


14 


269 


pc82m38 






ACA 


CCT 


TTC 


AAC 


ATA 








40 


15 


265 


pc82ml05 






ACG 


CCC 


TTC 


AAC 


ATA 








44 


15 


332 


pc82ml27 






CA 


CCT 


TTC 


AAC 


GTA 


ATG 






44 


17 


354 



WO 99/67428 



PCT/EP99/04317 



50 



Table 7- Cont'd 



pc82m40 
pc82m63 
pc82m36 
pc82m67 



86 



pc90w27 

pc90w37 

pc90w39 

pc90w50 

pc90w52 

pc90w69 

pc90w73 

pc90w79 

pc90ml38 

pc90m56 





ACA CCT GCC AAC ATA 




44 


15 

X *J 


267 




CA CCT GCC AAT ATA 


. AG 




42 


16 


290 

£m -J \J 




ACA CCT ACC AAC ATA 








518 

*J X U 




ACA CCT ACC AAC GT 








14 


519 


87 


88 89 90 91 92 


93 


94 








AGA 


AAT CTG TTG ACT CAG 


ATT 


GGT 










AAT CTG TTG ACT CA 






38 


14 


384 




T CTG TTG ACT CAG 


AT 






15 


514 


GA 


GTC AAC AGA GTT C 








15 


515 




AAT ATG TTG ACT CAG 






40 


15 


407 




AAT TTG TTG ACT CAG 






40 


15 


409 


GA 


AAC CTG TTG ACT 






40 


14 


426 




TG TTG ACA CAG 


CTT 


G 


44 


15 


430 




TG TTG ACC CAG 


ATT 


G 


44 


15 


436 


GTC ATC AGA TTT CT 








14 


510 




AAT ATG ATG ACC CAG 






42 


15 


413 
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CLAIMS 

Method for determining the susceptibility to antiviral drugs of HIV viruses in a biological 
sample, with said method comprising: 

a) if need be, releasing, isolating or concentrating the polynucleic acids present in the 
sample; 

b) if need be amplifying the relevant part of the protease gene of HIV with at least one 
suitable primer pair; 

c) hybridizing the polynucleic acids of step a) or b) with at least one of the following 
probes: 

probes specifically hybridizing to a target sequence comprising codon 30; 

probes specifically hybridizing to a target sequence comprising codon 46 and/or 48; 

probes specifically hybridizing to a target sequence comprising codon 50; 

probes specifically hybridizing to a target sequence comprising codon 54; 

probes specifically hybridizing to a target sequence comprising codon 82 and/or 84; 

probes specifically hybridizing to a target sequence comprising codon 90; 

or the complement of said probes: 

further characterized in that said probes specifically hybridize to any of the target sequences 
presented in figure 1, or to the complement of said target sequences; 

d) inferring from the result of step c) whether or not a mutation giving rise to drug resistance 
is present in any of said target sequences. 

Method according to claim 1, further characterized in that said polynucleic acids of step a) or 
b) hybridize with at least two of the said probes, or to the complement of said probes . 

Method according to claim. 2, further characterized in that said probes are chosen from the 
following list: seq id no 7 to seq id no 477, seq id no 510 to seq id no 519 or the complement of 
said probes. 

Method according to any of claims 1 to 3, further characterized in that said primer pair is chosen 
from the following primers: seq id no 3, seq id no 503, seq id no 504, seq id no 4, seq id no 506, 
seq id no 507, seq id no 508 and seq id no 509. 

Method according to any of claims 1 to 3, further characterized in that: 
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step b) comprises amplifying a fragment of the protease gene with at least one 5'-primer 
specifically hybridizing to a target sequence located at nucleotide position 210 to 260 of the 
protease gene, in combination with at least one suitable S'-primer, and 
step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence or its complement, comprising codon 90. 

6. Method according to any of claims 1 to 3, further characterized in that: 

step b) comprises amplifying a fragment of the protease gene with at least one 3-primer 
specifically hybridizing to a target sequence located at nucleotide position 253 fcodon 85) to 
position 300 . in combination with at least one suitable S'-primer, and 

step c) comprises hybridizing the polynucleic acids of step a) or b) with at least one of the probes 
specifically hybridizing to a target sequence or its complement comprising any of codons 30, 
46, 48, 50, 52, 54, 82 and 84. 

7. Method according to claim 5, further characterized in that the 5'-primer is seq id 5 and the S'- 
primer is one primer or a combination of primers chosen from the following primers: seq id no 
4, seq id no 506, seq id no 507, seq id no 508 and seq id no 509. 

8. Method according to claim 6, further characterized in that the 5'-primer is one primer or a 
combination of primers chosen form the following primers: seq id no 3, seq id no 503, seq id no 
504 and the 3'-primer is seq id no 6. 

9. A probe as defined in any of claims 1 to 3, for use in a method for determining the susceptibility 
to antiviral drugs of HIV viruses in a biological sample. 

10. A nucleic acid comprising a nucleotide sequence represented by any of the following SEQ ID 
numbers: SEQ ID NO 478, SEQ ID NO 479, SEQ ID NO 480, SEQ ID NO 481, SEQ ID NO 
482, SEQ ID NO 483, SEQ ID NO 484, SEQ ID NO 485, SEQ ID NO 486, SEQ ID NO 487, 
SEQ ID NO 488, SEQ ID NO 489, SEQ ID NO 490, SEQ ID NO 491, SEQ ID NO 492, SEQ 
ID NO 493, SEQ ID NO 494, SEQ ID NO 495, SEQ ID NO 496, SEQ ID NO 497, SEQ ID NO 
498, SEQ ID NO 499 and SEQ ID NO 500; 

or a fragment thereof, wherein said fragment consists of at least two contiguous nucleotides and 
contains at least one polymorphic nucleotide. 

11. A primer as defined in any of claims 4 to 8, for use in a method for determining the susceptibility 
to antiviral drugs of HIV viruses in a biological sample. 



WO 99/67428 



PCT/EP99/04317 



53 

A diagnostic kit enabling a method for determining the susceptibility to antiviral drugs of HIV 
viruses in a biological sample, with said kit comprising: 

a) when appropriate, a means for releasing, isolating or concentrating the polynucleic acids 
present in said sample; 

b) when appropriate, at least one of the primers of any of claims 4 to 6; 

c) at least one of the probes of any of claims 1 to 3, possibly fixed to a solid support; 

d) a hybridization buffer, or components necessary for producing said buffer; 

e) a wash solution, or components necessary for producing said solution; 

f) when appropriate, a means for detecting the hybrids resulting from the preceding 
hybridization; 

h) when appropriate, a means for attaching said probe to a solid support 
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GGA GGT TTT ATC AAA GTA AGA CAG 
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G A C G G A 
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